Target for therapy of cognitive impairment

ABSTRACT

The invention relates to methods of identifying genes involved in cognitive impairment and compositions for treating cognitive impairment.

2. CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This application claims the benefit of U.S. application Ser. No. 60/413,152, filed Nov. 22, 2002, which is incorporated by reference in its entirety.

1. GOVERNMENT SUPPORT

[0002] This invention was made with government support under grant No. PO1 AG09973 awarded by the National Institutes on Aging. The government may have certain rights in the invention.

3. BACKGROUND OF THE INVENTION

[0003] As the understanding of cognitive impairment increases so does the necessity to develop sensitive methods to detect the impairment and treatments for the impairment. There are many conditions, such as dementias (e.g. Lewy body dementia, vascular dementia, Alzheimer's Disease, and HIV associated dementia), Huntington's Disease, Parkinson's Disease, schizophrenia, depression, amyotrophic lateral sclerosis, Mild Cognitive Impairment (MCI) and Age Related Cognitive Decline (ARCD), of which sensitive detection of cognitive impairment would benefit the sufferer of the condition.

[0004] A major risk factor for a variety of conditions with cognitive impairment (such as Lewy body dementia, vascular dementia, Alzheimer's Disease, HIV associated dementia, Huntington's Disease, Parkinson's Disease, amyotrophic lateral sclerosis, MCI and ARCD) is aging. Individuals with these conditions have cognitive symptoms that increase in severity over the course of the disease. The effect of aging itself on cognition, in the absence of such disease, is important for defining the boundary between illness and normal aging. At the same time, the effects of aging on cognition may interact with the disease process in neurodegenerative illnesses, in determining vulnerability, rate of progression or other features of illness.

[0005] An important resource for developing detection methods and treatments for cognitive impairments includes using laboratory animals. Features that characterize cognitive impairments in animal models likely extend to cognitive impairments in humans. In the context of age-related cognitive impairments, extensive behavioral characterization has identified a naturally occurring form of cognitive impairment in an outbred strain of aged Long-Evans rats (Charles River Laboratories; Gallagher M, et al., Behav. Neurosci. 107:618-626; 1993). This model of cognitive aging uses animals that are maintained pathogen-free throughout their lives. Tests of physiological function and necropsies performed on all aged rats are used to exclude animals with conditions that would confound the study of aging with illness or disease. An important feature of this model is that it mirrors the phenomenon of variability in cognitive decline among elderly humans. Furthermore, the individual differences in cognitive decline in aged rats in this model are seen in a behavioral assessment that is sensitive to the function of interconnected structures in the medial temporal lobe, a system that is essential for declarative memory in humans.

[0006] Another important feature of this model is that it is directed to understanding multiplicity of genes that contribute to age-associated cognitive impairment. The genetic contribution to age-associate cognitive impairment is not likely to be monogenic, meaning caused by a deletion or mutation in a single gene. Monogenic diseases are very rare and typically affect the young. Because of their severity, monogenic diseases are frequently inconsistent with attainment of average life expectancy. In humans, the vast majority of common but serious conditions affect the adult population, increase in frequency and severity with increasing chronological age and cannot be attributed to a single gene (see for example, Hegele RA. Trends Endocrinol Metab. 2003 8:371-377; Shih D Q, et al. Curr Diab Rep. 2002 2:125-134; Barlassina C, et al. J Am Soc Nephrol. 2002 Suppl 3:S155-S164). Accumulating evidence suggests that the genetic component of maturity onset or aging-associated conditions reflects more subtle changes in expression of multiple genes than the absolute deficiency or dramatic gain of function underlying monogenic disease. The challenge in defining the molecular basis of these age-associated conditions is to identify the multiplicity of genes and establish if the relatively small changes in expression of a defined group of genes indeed associate with or lead to condition in an outbred population such as the human population. Thus, using a mammalian outbred model of aging facilitates the analysis of the relationship between levels of expression of multiple genes within the hippocampus and learning ability, in out-bred young and aged subjects.

[0007] In a behavioral assessment with the Morris Water Maze (MWM), rats learn and remember the location of an escape platform guided by a configuration of spatial cues surrounding the maze. The cognitive basis of performance is tested in probe trials using measures of the animal's spatial bias in searching the location of the escape platform. Aged rats in the study population have no difficulty swimming to a visible platform, but an age-dependent impairment is detected when the platform is camouflaged, requiring the use of spatial information. As reported in many publications, performance for individual aged rats in the outbred Long-Evans strain varies greatly, with a proportion of those rats performing on a par with young adults but approximately 40-50% falling outside the range of young performance (Gallagher et al. Behav. Neurosci. 107: 618-626, 1993). This variability among aged rats reflects reliable individual differences. Thus, within the aged population some animals are cognitively impaired and designated aged impaired (AI). Other aged animals are cognitively unimpaired, or aged unimpaired (AU).

[0008] In a reassessment using the MWM in a new spatial environment several weeks after the original characterization, the AI animals are consistently impaired, whereas the AU animals again perform proficiently (Colombo et al. Proc. Natl. Acad. Sci. 94: 14195-14199, 1997). The difference in cognitive ability in the MWM assessment for AI and AU rats is reliable even over an interval of 3 months (Gallagher and Burwell, Neurobiol. Aging 10: 691-708, 1989). Further, AI and AU characterization in the MWM differentiates the performance of the same aged subjects in other behavioral tasks that require the same cognitive function, such as the Barnes circular maze (Gallagher and Burwell Neurobiol. Aging 10: 691-708, 1989), and the radial arm maze (RAM). This naturally occurring impairment in an aged population of rodents indicates that cognitive aging is not inevitable or strictly linked to chronological age, and, importantly, it affords the opportunity to compare the trajectory of changes in the brain that lead to decline or preserved memory. Additional background research using this model indicates that cognitive impairment occurs independent of neurodegeneration involving loss of neurons or widespread degradation of relevant circuits (Rapp and Gallagher Proc. Natl. Acad. Sci. 93: 9926-9930, 1996). Thus, this model is likely to be a more sensitive test of cognitive aging than preparations intended to measure the effects of neuron loss.

[0009] In addition to reliability, the cognitive assessment used in this model has proven sensitive to effects of aging on relevant brain systems. Significant biological differences have been shown to occur in AU and AI rats within neural circuits that are critical for the cognitive function assessed in the MWM. For example, the neurons in the hippocampus have a reduced response to certain chemical transmitters, such as acetylcholine and glutamate, in AI rats as compared to both AU and young rats (Nicolle et al. J. Neurosci. 19: 9604-9610, 1999). In a study of the anatomical distribution of glutamate receptor subtypes, the use of this model revealed a decrease in kainate binding in the CA3 region of the hippocampus that was confined to aged unimpaired rats and differed from both young and aged impaired (Nicolle et al. Neuroscience 74: 741-756, 1996). There is a need for a greater understanding of the biological and genetic basis of cognitive impairment.

4. SUMMARY OF THE INVENTION

[0010] In one aspect, the invention features a method of identifying a gene associated with a desired behavior of a subject, such as a mammal, comprising providing a test population of subjects having the desired behavior, providing a control population of subjects lacking the desired behavior, isolating and pooling expressed RNA from neural tissue, such as the hippocampus, of the test and control populations, respectively, determining the level of expression of a plurality of genes in each of the control and test RNA pools and selecting a gene from the plurality of genes, the expression of which differs between the test population and the control population of mammals. The selected gene is a candidate gene associated with a desired behavior. The level of expression of the plurality of genes may be detected by any appropriate means, such as microarray analysis, in situ hybridization histochemistry, quantitative PCR, SAGE analysis, Northern blot analysis or dot blot analysis, or by appropriate methods of measuring protein levels, including Western blot, protein slot blot or protein arrays. The plurality of genes may comprise genes involved in glutamate transport, such as EAAT1, EAAT2, EAAT3, EAAT4 and EAAT5, genes other than the glutamate transporters EAAT1, EAAT2, EAAT3, EAAT4 and EAAT5 or genes involved in the catabolism of glutamate in the synaptic cleft and/or extrasynaptic space between neurons, such as aspartate aminotransferase. Preferably, the gene selected from the plurality of genes exhibits increased levels of expression. Alternatively, the gene selected may exhibit decreased levels of expression.

[0011] In another aspect, the invention features a method of identifying a gene associated with a cognitive function of a subject comprising providing a test population of mammals having the desired cognitive function, providing a control population of mammals lacking the desired cognitive function, isolating and pooling expressed RNA from neural tissue, such as the hippocampus, of the test and control populations, respectively, determining the level of expression of a plurality of genes in each of the control and test RNA pools and selecting a gene from the plurality of genes, the expression of which differs between the test population and the control population of mammals. The selected gene is a candidate gene associated with a desired cognitive function. The level of expression of the plurality of genes may be measured by any appropriate means, such as microarray analysis, in situ hybridization histochemistry, quantitative PCR, SAGE analysis, Northern blot analysis or dot blot analysis, or by appropriate methods of measuring protein levels, including Western blot, protein slot blot or protein arrays. The plurality of genes may comprise genes involved in glutamate transport, such as EAAT1, EAAT2, EAAT3, EAAT4 and EAAT5, genes other than the glutamate transporters EAAT1, EAAT2, EAAT3, EAAT4 and EAAT5 or genes involved in the catabolism of glutamate in the synaptic cleft and/or extrasynaptic space between neurons, such as aspartate aminotransferase. Preferably, the gene selected from the plurality of genes exhibits increased levels of expression. Alternatively, the gene selected may exhibit decreased levels of expression.

[0012] Another aspect of the invention involves a method of screening compounds for utility in promoting cognitive function comprising administering a test compound to a subject, such as a mammal, determining the level of expression of a gene in neural tissue, such as the hippocampus, of said subject following administration of said test compound, comparing said level of expression of said gene to a reference level of expression thereof in neural tissue of a subject to whom said test compound was not administered and determining whether the level of expression of said gene differs from the corresponding reference level of expression thereof, wherein said difference indicates that the test compound is a candidate therapeutic agent for promoting cognitive function. The test compound may be a small molecule, such as but not limited to those found in formula I, II or III. Further, the method may comprise comparing the level of expression of said gene to a reference level of expression thereof in neural tissue of a subject to whom ceftriaxone was administered. The level of expression of the gene may be detected by any appropriate means, such as microarray analysis, in situ hybridization histochemistry, quantitative PCR, SAGE analysis, Northern blot analysis or dot blot analysis, or by appropriate methods of measuring protein levels, including Western blot, protein slot blot or protein arrays. The gene may be involved in glutamate transport, such as EAAT1, EAAT2, EAAT3, EAAT4 and EAAT5 or may be involved in the catabolism of glutamate in the synaptic cleft and/or extrasynaptic space between neurons, such as aspartate aminotransferase. Preferably, the gene selected from the plurality of genes exhibits increased levels of expression. Alternatively, the gene selected may exhibit decreased levels of expression.

[0013] Another aspect of the invention involves a method of screening compounds for utility in promoting cognitive function comprising administering a test compound to a subject, such as a mammal, determining the level of expression of a glutamate transporter gene in neural tissue, such as the hippocampus, of said subject following administration of said test compound, comparing said level of expression of said gene to a reference level of expression thereof in neural tissue of a subject to whom said test compound was not administered and determining whether the level of expression of said gene differs from the corresponding reference level of expression thereof, wherein said difference indicates that the test compound is a candidate therapeutic agent for promoting cognitive function. The test compound may be a small molecule, such as but not limited to those found in formula I, II or III. The level of expression of the gene may be detected by any appropriate means, such as microarray analysis, in situ hybridization histochemistry, quantitative PCR, SAGE analysis, Northern blot analysis or dot blot analysis, or by appropriate methods of measuring protein levels, including Western blot, protein slot blot or protein arrays. Preferably, the gene selected from the plurality of genes exhibits increased levels of expression. Alternatively, the gene selected may exhibit decreased levels of expression.

[0014] A method of screening compounds for utility in promoting cognitive function in a subject, such as a mammal, comprising the steps of contacting a test compound with a cell expressing a gene listed in FIG. 4, e.g., a glutamate transporter gene EAAT1, 2, 3, 4 or 5, aspartate aminotransferase or pituitary adenyl cyclase activator polypeptide (PACAP), and determining whether the level of expression of said gene is changed by contact of said cell with said test compound, said change if present being indicative of the ability of said compound to promote cognitive function in a subject, such as a mammal, in need thereof. The compound may be a small molecule, such as those found in formula I, II or III. The cell may be derived from neural tissue, such as culturered neurons, cultured glia or primary neuronal culture; or may be an immortalized cell, a neuronal cell line, glial cell line or an astrocyte cell line. Preferably, the gene selected from the plurality of genes exhibits increased levels of expression. Alternatively, the gene selected may exhibit decreased levels of expression.

[0015] The test compound used in each of the above-mentioned aspects of the invention may be a small molecule, such as any of the third-generation cephalosporins (cefsulodin, cefotaxime, ceftizoxime, ceftriaxone, cefoperazone, moxalactam, and ceftazidime), valproic acid or MS-153. Further the test compound may activate gene expression, including the glutamate transporters selected from the group consisting of EAAT1, EAAT2, EAAT3, EAAT4 and EAAT5, or the aspartate aminotransferase gene. Alternatively, the test compound may be an inhibitor of gene expression.

[0016] In another aspect, the invention features a library comprising a plurality of cDNA sequences coding for genes that are differentially expressed in mammalian neural tissue upon preservation of cognitive function in a mammal. Preferably, a library comprises cDNA sequences coding for genes that are differentially expressed in neural tissue upon treatment of the mammal with ceftriaxone, valproic acid or MS-153. The library may contain cDNA sequences derived for a glutamate transporter gene, such as EAAT1, EAAT2, EAAT3, EAAT4 and EAAT5, or a sequence derived from aspartate aminotransferase. The library containing cDNA at least 20%, 50% or 80% of sequences derived from a glutamate transpoter gene.

[0017] Another aspect of the invention is a microarray chip comprising a solid support having attached thereto, at individually addressed locations, cDNA sequences corresponding to members of the above-mentioned cDNA library, such as those cDNA sequences that are differentially expressed in neural tissue upon preservation of cognitive function in a subject or upon treating the subject with ceftriaxone or valproic acid. Members of the microarray chip include either a glutamate transporter sequence selected from the group consisting of EEAT1, EEAT2, EEAT3, EEAT4, and EEAT or an aspartate aminotransferase sequence.

[0018] The invention also features a pharmaceutical composition comprising a therapeutically effective amount of a compound that stimulates neural tissue expression of a gene listed in FIG. 4, e.g., a glutamate transporter gene EAAT1, 2, 3, 4 or 5, aspartate aminotransferase or pituitary adenyl cyclase activator polypeptide (PACAP). The pharmaceutical composition may further comprise a small molecule.

[0019] In yet another aspect, the invention features a pharmaceutical composition comprising a therapeutically effective amount of formula I, II or III. Alternative, the pharmaceutical composition may comprise a therapeutically effective amount of a compound other than ceftriaxone or valproic acid, that was identified by a method screening compounds for utility in promoting cognitive function by administering compounds to subjects, such as mammals, or cells and measuring differential gene expression between those subjects or cells with and without exposure to the compounds. These compounds are candidate compounds for promoting cognitive function.

[0020] Another aspect of the invention features a method for preserving cognitive function in a mammal, such as a human, or treating impaired cognitive function in a mammal, such as a human, by stimulating the neural tissue expression of a gene involved in either glutamate transport or glutamate catabolism in neural tissue. Further, preserving cognitive function in a mammal, such as a human, in need thereof comprises administering a pharmaceutical composition that that stimulates neural tissue expression of a gene listed in FIG. 4, e.g., a glutamate transporter gene EAAT1, 2, 3, 4 or 5, aspartate aminotransferase or pituitary adenyl cyclase activator polypeptide (PACAP).

[0021] The invention also features a method of preserving cognitive function in a mammal, such as a human, in need thereof comprises administering a pharmaceutical composition that is a small molecule of any one of the following formulas: I, II or III. For a method of preserving cognitive function in a mammal, such as a human, in need thereof comprises administering a compound of formula I, the mammal is free of symptoms of an infectious disease for which antibiotic treatment is indicated.

[0022] The invention also features promoting cognitive function in a mammal, such as a human, in need thereof, comprising administering to said mammal an amount of a pharmaceutical composition that stimulates neural tissue expression of a gene listed in FIG. 4 sufficient to promote the following cognitive functions: spatial memory acquisition, long term spatial memory or spatial memory retrieval. The invention also features preserving cognitive function or treating cognitive impairment in an aged mammal, such as a human, and treating impaired cognitive function in a mammal, such as a human, by administering a therapeutically effective amount of ceftriaxone or analogs or derivatives thereof, valproic acid or analogs or derivatives thereof or MS-153 or analogs or derivatives thereof to the mammal in need thereof. In the cases in which a mammal manifests impaired cognitive function the impaired cognitive function may be associated with one of the following conditions: mild cognitive impairment, age related cognitive decline, memory loss, senility or dementia. Additionally, cases in which a mammal manifests impaired cognitive function the impaired cognitive function may be associated Alzheimer's Disease.

[0023] Other features and advantages of the invention will be apparent based on the following Detailed Description and Claims.

5. BRIEF DESCRIPTION OF THE DRAWINGS

[0024]FIG. 1 is a graph depicting the behavioral characterization of young and aged rats in the MWM assessment.

[0025]FIG. 2 is a graph depicting the reliability between the initial MWM characterization for 10 aged rats and and their memory performance in the RAM

[0026]FIG. 3 is a table summarizing the distribution of mammalian glutamate transporters and their human homologues in various cell types found in brain tissue.

[0027]FIG. 4 is a table summarizing the expression of the EAAT2/GLT1, EAAT1/GLAST, and EAAT3/EEAC1 mRNAs in young (Y), aged-impaired (AI) and aged-unimpaired (AU) animals using the microarray.

[0028]FIG. 5 is a table summarizing the abundance of EAAT2/GLT1, EAAT1/GLAST, and EAAT3/EEAC1 mRNAs in young (Y), aged-impaired (AI) and aged-unimpaired (AU) animals using in situ hybridization histochemistry.

[0029]FIG. 6 is a graph depicting the reduction of memory errors in AI rats treated with ceftriaxone (daily injection of 200 mg/kg im, for 1 week).

6. DETAILED DESCRIPTION OF THE INVENTION

[0030] For convenience, certain terms employed in the specification, examples, and appended claims are collected here. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

6.1 Definitions

[0031] The articles “a” and “an” are used herein to refer to one or to more than one (i.e., to at least one) of the grammatical object of the article. By way of example, an element means one element or more than one element.

[0032] “Aged” is used herein to refer to mammals at or near the end of their average life span. For example, an aged rat would be about 24-30 months of age. An aged human would be seventy or more years of age.

[0033] The term “aliphatic” is art-recognized and refers to a linear, branched, cyclic alkane, alkene, or alkyne. In certain embodiments, aliphatic groups in the present invention are linear or branched and have from 1 to about 20 carbon atoms.

[0034] The term “alkyl” is art-recognized, and includes saturated aliphatic groups, including straight-chain alkyl groups, branched-chain alkyl groups, cycloalkyl (alicyclic) groups, alkyl substituted cycloalkyl groups, and cycloalkyl substituted alkyl groups. In certain embodiments, a straight chain or branched chain alkyl has about 30 or fewer carbon atoms in its backbone (e.g., C₁-C₃₀ for straight chain, C₃-C₃₀ for branched chain), and alternatively, about 20 or fewer. Likewise, cycloalkyls have from about 3 to about 10 carbon atoms in their ring structure, and alternatively about 5, 6 or 7 carbons in the ring structure. The term “alkyl” is also defined to include halosubstituted alkyls.

[0035] The terms “amine” and “amino” are art-recognized and refer to both unsubstituted and substituted amines, e.g., a moiety that may be represented by the general formulas:

[0036] wherein R50, R51 and R52 each independently represent a hydrogen, an alkyl, an alkenyl, —(CH₂)_(m)—R61, or R50 and R51, taken together with the N atom to which they are attached complete a heterocycle having from 4 to 8 atoms in the ring structure; R61 represents an aryl, a cycloalkyl, a cycloalkenyl, a heterocycle or a polycycle; and m is zero or an integer in the range of 1 to 8. In certain embodiments, only one of R50 or R51 may be a carbonyl, e.g., R50, R51 and the nitrogen together do not form an imide. In other embodiments, R50 and R51 (and optionally R52) each independently represent a hydrogen, an alkyl, an alkenyl, or —(CH₂)_(m)—R61. Thus, the term “alkylamine” includes an amine group, as defined above, having a substituted or unsubstituted alkyl attached thereto, i.e., at least one of R50 and R51 is an alkyl group.

[0037] The term “acylamino” is art-recognized and refers to a moiety that may be represented by the general formula:

[0038] wherein R50 is as defined above, and R54 represents a hydrogen, an alkyl, an alkenyl or —(CH₂)_(m)—R61, where m and R61 are as defined above.

[0039] The term “amido” is art recognized as an amino-substituted carbonyl and includes a moiety that may be represented by the general formula:

[0040] wherein R50 and R51 are as defined above. Certain embodiments of the amide in the present invention will not include imides which may be unstable.

[0041] The term “alkylthio” refers to an alkyl group, as defined above, having a sulfur radical attached thereto. In certain embodiments, the “alkylthio” moiety is represented by one of —S-alkyl, —S-alkenyl, —S-alkynyl, and —S—(CH₂)_(m)—R61, wherein m and R61 are defined above. Representative alkylthio groups include methylthio, ethyl thio, and the like.

[0042] The term “aralkyl” is art-recognized and refers to an alkyl group substituted with an aryl group (e.g., an aromatic or heteroaromatic group).

[0043] The terms “alkenyl” and “alkynyl” are art-recognized and refer to unsaturated aliphatic groups analogous in length and possible substitution to the alkyls described above, but that contain at least one double or triple bond respectively.

[0044] Unless the number of carbons is otherwise specified, “lower alkyl” refers to an alkyl group, as defined above, but having from one to about ten carbons, alternatively from one to about six carbon atoms in its backbone structure. Likewise, “lower alkenyl” and “lower alkynyl” have similar chain lengths.

[0045] The terms “alkoxyl” or “alkoxy” are art-recognized and refer to an alkyl group, as defined above, having an oxygen radical attached thereto. Representative alkoxyl groups include methoxy, ethoxy, propyloxy, tert-butoxy and the like. An “ether” is two hydrocarbons covalently linked by an oxygen. Accordingly, the substituent of an alkyl that renders that alkyl an ether is or resembles an alkoxyl, such as may be represented by one of —O-alkyl, —O-alkenyl, —O-alkynyl, —O——(CH₂)_(m)—R61, where m and R61 are described above. “Analog” is used herein to refer to a compound which functionally resembles another chemical entity, but does not share the identical chemical structure thereof. For example, a ceftriaxone analog is sufficiently similar to ceftriaxone that it can substitute for the ceftriaxone in therapeutic applications, despite minor structural differences from the structure of ceftriaxone.

[0046] The terms “array” and “matrix” are used herein to refer to an arrangement of addressable locations or “addresses” on a device. The locations can be arranged in two dimensional arrays, three dimensional arrays, or other matrix formats. The number of locations can range from several to at least hundreds of thousands. Most importantly, each location represents a totally independent reaction site. A “nucleic acid array” refers to an array containing nucleic acid probes, such as oligonucleotides or larger portions of genes. The nucleic acid on the array may be single stranded¹. Arrays wherein the probes are oligonucleotides are referred to as “oligonucleotide arrays” or “oligonucleotide chips.” A “microarray,” also referred to herein as a “biochip,” “biological chip” or “gene array” is an array of regions having a density of discrete regions of at least about 100/cm², and preferably at least about 1000/cm². The regions in a microarray have typical dimensions, e.g., diameters, in the range of between about 10-250 μm, and are separated from other regions in the array by about the same distance.

[0047] “Aspartate aminotransferase” is used herein to refer to the enzyme (E.C. 2.6.1.1) that catalyzes the coversion of oxaloacetate and glutamate into aspartate and 2-oxoglutarate, and the nucleic acid and homologs (see for example, GenBank accession Nos.: BC000498 or XM_(—)062678) encoding amino acids with aspartate aminotransferase activity. Aspartate aminotransferase is involved in the catabolism of glutamate in the synaptic cleft and extrasynaptic space. Homologs of the foregoing are believed to exist in other mammals, including primates, canines, felines and rodents.

[0048] “Beta-arrestin 2” is used herein to refer to the intracellular scaffold/adapter proteins that facilitate the transmission of additional signals from activated G protein-coupled receptors. Additionally, these proteins are involved in the endocytosis of transmembrane receptor endocytosis. Beta-arrestin 2 also refers to the nucleic acids that encode the β-arrestin protein. Homologs of the foregoing are believed to exist in other mammals, including primates, canines, felines and rodents.

[0049] The term “carbocycle” is art-recognized and refers to an aromatic or non-aromatic ring in which each atom of the ring is carbon.

[0050] The term “carbonyl” is art recognized and includes such moieties as may be represented by the general formulas:

[0051] wherein X50 is a bond or represents an oxygen or a sulfur, and R55 and R56 represents a hydrogen, an alkyl, an alkenyl, —(CH₂)_(m)—R61 or a pharmaceutically acceptable salt, R56 represents a hydrogen, an alkyl, an alkenyl or —(CH₂)_(m)—R61, where m and R61 are defined above. Where X50 is an oxygen and R55 or R56 is not hydrogen, the formula represents an “ester”. Where X50 is an oxygen, and R55 is as defined above, the moiety is referred to herein as a carboxyl group, and particularly when R55 is a hydrogen, the formula represents a “carboxylic acid”. Where X50 is an oxygen, and R56 is hydrogen, the formula represents a “formate”. In general, where the oxygen atom of the above formula is replaced by sulfur, the formula represents a “thiolcarbonyl” group. Where X50 is a sulfur and R55 or R56 is not hydrogen, the formula represents a “thiolester.” Where X50 is a sulfur and R55 is hydrogen, the formula represents a “thiolcarboxylic acid.” Where X50 is a sulfur and R56 is hydrogen, the formula represents a “thiolformate.” On the other hand, where X50 is a bond, and R55 is not hydrogen, the above formula represents a “ketone” group. Where X50 is a bond, and R55 is hydrogen, the above formula represents an “aldehyde” group.

[0052] The term “chiral” is art-recognized and refers to molecules which have the property of non-superimposability of the mirror image partner, while the term “achiral” refers to molecules which are superimposable on their mirror image partner. A “prochiral molecule” is a molecule which has the potential to be converted to a chiral molecule in a particular process.

[0053] The term “cis” is art-recognized and refers to the arrangement of two atoms or groups around a double bond such that the atoms or groups are on the same side of the double bond. Cis configurations are often labeled as (Z) configurations.

[0054] “Cognitive function” is used herein to refer to higher order intellectual, brain processes involved in learning and memory, including, but not limited to, attention, acquisition, short-term memory, long-term memory and memory retrieval, and expressing an interest in one's surroundings and self-care. In animal model systems, cognitive function may be measured any number of ways known in the art, including using the following apparati: Morris water maze, Barnes circle maze, elevated radial arm maze, T maze or any other mazes in which subjects use spatial information. Other tests known in the art may be used to assess cognitive function, such as fear conditioning, active avoidance, illuminated open-field, dark activity meter, elevated plus-maze, two-compartment exploratory test or forced swimming test. In humans, cognitive function may be measured, without limitation, by the Alzheimer's Disease Assessment Scale-cognitive subscale (ADAS-cog); the clinical global impression of change scale (CIBIC-plus scale); the Alzheimer's Disease Cooperative Study Activities of Daily Living Scale (ADCS-ADL); the Mini Mental State Exam (MMSE); the Neuropsychiatric Inventory (NPI); the Clinical Dementia Rating Scale (CDR); the Cambridge Neuropsychological Test Automated Battery (CANTAB) or the Sandoz Clinical Assessment-Geriatric (SCAG). In addition, cognitive function may be measured using imaging techniques such as Positron Emission Tomography (PET), functional magnetic resonance imaging (fMRI), Single Photon Emission Computed Tomography (SPECT), or any other imaging technique that allows one to measure brain function.

[0055] “Promoting” cognitive function refers to affecting impaired cognitive function so that it more closely resembles the function of an aged-matched normal, unimpaired subject, and includes affecting states in which cognitive function is reduced, e.g., by about 10%, 30%, 50%, 75%, 90% or 95% as compared to a normal subject. Cognitive function may be promoted to any detectable degree, but preferably is promoted sufficiently to allow an impaired subject to carry out daily activities of normal life.

[0056] “Preserving” cognitive function refers to affecting normal or impaired cognitive function such that it does not decline or does not fall below that observed in the subject upon first presentation or diagnosis.

[0057] “Impaired cognitive function” refers to cognitive function that is not as robust as that observed in an age-matched normal subject and includes states in which cognitive function is reduced, e.g., by about 10%, 30%, 50%, 75%, 90% or 95% as compared to cognitive function measured in an age-matched normal subject. Impaired cognitive function may be associated with many diseases or disorders, involving dementias (e.g. Lewy body dementia, vascular dementia, Alzheimer's Disease, and HIV associated dementia), Huntington's Disease, Parkinson's Disease, schizophrenia, amyotrophic lateral sclerosis, Mild Cognitive Impairment (MCI) and Age Related Cognitive Decline (ARCD). Alternatively, impaired cognitive function may manifest in a subject that does not present with a diagnosable disease or disorder. For instance, impaired cognitive function may result from subtle metabolic, toxic, neurotoxic, iatrogenic, thermal or chemical changes in the subject. These subtle changes include without limitation, ischemia, hypoxia, cerebrovascular accident, trauma, surgery, pressure, mass effect, hemmorrhage, radiation, vasospasm, neurodegenerative disease or infection.

[0058] “Control population” is used herein to refer to mammals lacking a desired behavior associated with cognitive function, and usually includes mammals that are not young.

[0059] The term “covalent bond” is art-recognized and refers to a bond between two atoms where electrons are attracted electrostatically to both nuclei of the two atoms, and the net effect of increased electron density between the nuclei counterbalances the internuclear repulsion. The term covalent bond includes coordinate bonds when the bond is with a metal ion.

[0060] The terms “combinatorial library” or “library” are art-recognized and refer to a plurality of compounds, which may be termed “members,” synthesized or otherwise prepared from one or more starting materials by employing either the same or different reactants or reaction conditions at each reaction in the library. There are a number of other terms of relevance to combinatorial libraries (as well as other technologies). The term “identifier tag” is art-recognized and refers to a means for recording a step in a series of reactions used in the synthesis of a chemical library. The term “immobilized” is art-recognized and, when used with respect to a species, refers to a condition in which the species is attached to a surface with an attractive force stronger than attractive forces that are present in the intended environment of use of the surface, and that act on the species. The term “solid support” is art-recognized and refers to a material which is an insoluble matrix, and may (optionally) have a rigid or semi-rigid surface. The term “linker” is art-recognized and refers to a molecule or group of molecules connecting a support, including a solid support or polymeric support, and a combinatorial library member. The term “polymeric support” is art-recognized and refers to a soluble or insoluble polymer to which a chemical moiety can be covalently bonded by reaction with a functional group of the polymeric support. The term “functional group of a polymeric support” is art-recognized and refers to a chemical moiety of a polymeric support that can react with an chemical moiety to form a polymer-supported amino ester.

[0061] “Derivative” is used herein to refer to the chemical modification of a compound, e.g., a cephalosporin or valproic acid. Chemical modifications of a compound can include, for example, replacement of hydrogen by an alkyl, acyl, or amino group. Many other modifications are also possible. A derivative of a compound retains at least one fuinctional property of the original compound.

[0062] “Desired behavior” is used herein to refer to a behavioral manifestion of cognitive function as observed in a normal unimpaired subject. For example, in animals the desired behavior reflect the animals' cognitive function as measured on any one of a number of apparati, such as the Morris water maze, Barnes circle maze, elevated radial arm maze, T maze; or by any one of a number of tests, such as fear conditioning, active avoidance, illuminated open-field, dark activity meter, elevated plus-maze, two-compartment exploratory test or forced swimming test. In humans, the desired behavior reflect the subjects' cognitive function as measured by the ability of the subject to carry out daily activities of norman life or may be measured by performance on any number of tests for cognitive function including but not limited to ADAS-cog, CIBIC-plus scale, ADCS-ADL, MMSE, NPI, CDR, CANTAB or SCAG.

[0063] The term “heteroatom” is art-recognized and refers to an atom of any element other than carbon or hydrogen. Illustrative heteroatoms include boron, nitrogen, oxygen, phosphorus, sulfur and selenium.

[0064] The term “aryl” is art-recognized and refers to 5-, 6- and 7-membered single-ring aromatic groups that may include from zero to four heteroatoms, for example, benzene, pyrrole, fluran, thiophene, imidazole, oxazole, thiazole, triazole, pyrazole, pyridine, pyrazine, pyridazine and pyrimidine, and the like. Those aryl groups having heteroatoms in the ring structure may also be referred to as “heteroaryl.” The aromatic ring may be substituted at one or more ring positions with such substituents as described above, for example, halogen, azide, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, alkoxyl, amino, nitro, sulfhydryl, imino, amido, phosphonate, phosphinate, carbonyl, carboxyl, silyl, ether, alkylthio, sulfonyl, sulfonamido, ketone, aldehyde, ester, heterocyclyl, aromatic or heteroaromatic moieties, —CF₃, —CN, or the like. The term “aryl” also includes polycyclic ring systems having two or more cyclic rings in which two or more carbons are common to two adjoining rings (the rings are “fused rings”) wherein at least one of the rings is aromatic, e.g., the other cyclic rings may be cycloalkyls, cycloalkenyls, cycloalkynyls, aryls and/or heterocyclyls.

[0065] The terms ortho, meta and para are art-recognized and refer to 1,2-, 1,3- and 1,4-disubstituted benzenes, respectively. For example, the names 1,2-dimethylbenzene and ortho-dimethylbenzene are synonymous.

[0066] The terms “heterocyclyl” or “heterocyclic group” are art-recognized and refer to 3- to about 10-membered ring structures, alternatively 3- to about 7-membered rings, whose ring structures include one to four heteroatoms. Heterocycles may also be polycycles. Heterocyclyl groups include, for example, thiophene, thianthrene, fluran, pyran, isobenzofuran, chromene, xanthene, phenoxanthene, pyrrole, imidazole, pyrazole, isothiazole, isoxazole, pyridine, pyrazine, pyrimidine, pyridazine, indolizine, isoindole, indole, indazole, purine, quinolizine, isoquinoline, quinoline, phthalazine, naphthyridine, quinoxaline, quinazoline, cinnoline, pteridine, carbazole, carboline, phenanthridine, acridine, pyrimidine, phenanthroline, phenazine, phenarsazine, phenothiazine, furazan, phenoxazine, pyrrolidine, oxolane, thiolane, oxazole, piperidine, piperazine, morpholine, lactones, lactams such as azetidinones and pyrrolidinones, sultams, sultones, and the like. The heterocyclic ring may be substituted at one or more positions with such substituents as described above, as for example, halogen, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, phosphonate, phosphinate, carbonyl, carboxyl, silyl, ether, alkylthio, sulfonyl, ketone, aldehyde, ester, a heterocyclyl, an aromatic or heteroaromatic moiety, —CF₃, —CN, or the like.

[0067] “Differentially expressed” is used herein to refer to the differing levels of expression, including both quantitative and qualitative measurements, of a gene of interest in tissues that have been treated differently or have been exposed to different environmental factors or changes in the physiological milieu.

[0068] “Gene” or “gene sequence” is used herein to refer to the partial or complete coding sequence of a gene, its compliment, and its 5′ or 3′ untranslated regions. The “coding sequence” of the gene is that set of nucleotides that are present in mRNA transcript of the gene. “Gene expression” refers to the process of making, or transcribing, an RNA based upon the DNA sequence of the gene. An “activator” of gene expression refers to a compound that stimulates the transcription of a gene's DNA sequence into a RNA transcript. “Endogenous” genes are genes naturally found within the species and not artificially incorporated, such as by random insertion or transfection, into the genome of an organism or cell.

[0069] “Glutamate transporter” is used herein to refer to transmembrane proteins that remove L-glutamate, the primary excitatory neurotransmitter in the mammalian central nervous system (CNS), from the extracellular space, including the synaptic cleft and extrasynaptic space. Glutamate transporters may be found in the membranes of both neurons and glial cells. Several glutamate transporters have been identified in humans and include, for example, Solute Carrier family 1, member 1 (SLC1A1 or EAAC1 or EAAT3; for example GenBank Accession No.:NM_(—)004170), Solute Carrier family 1, member 2 (SLC1A2 or EAAT2 or GLT1; for example GenBank Accession No.:NM_(—)004171), Solute Carrier family 1, member 3 (SLC1A3 or EAAT1, GLAST or GLAST1; for example GenBank Accession No.:NM_(—)004172), Solute Carrier family 1, member 6 (SLC1A6 or EAAT4; for example GenBank Accession No.:NM_(—)005071) and Solute Carrier family 1, member 7 (SLC1A7 or EAAT5; for example GenBank Accession No.:NM_(—)006671). Further, glutamate transporters have been identified in Rattus norvegicus and Mus musculus (Slc1a1/Eaac1/REAAC1, Slc1a2/GluT/GLT-1/GluT-R, Slc1a3/Eaat1/GLAST/GluT-1 and Slc1a6/Eaat4). Homologs of the foregoing are believed to exist in other mammals, including primates, canines, felines and rodents. The activity of a glutamate transporter protein is increased by administration of an agent that increases glutamate transporting activity of a glutamate transporter protein. Examples of agents reported to increase glutamate transport protein activativity include, for example, ((R)-(−)-5-methyl-1-nicotinoyl-2-pyrazoline (MS-153; Shimada et al., Eur J Pharmacol. 386:263-70, 1999); lidocaine (Do et al., Anesth Analg. 95:1263-8, 2002) and kinase inhibitors (e.g., Conradt, J Neurochem. 68:1244-51, 1997).

[0070] “Level of expression” of a gene is used herein to refer to the level of gene expression as measured by any method used to detect the presence of, a threshold amount of, a quantitative or qualitative measure of the expression of a gene- e.g. by measuring mRNA levels (e.g. by “Northern blot” or “microarray analysis”) or protein (e.g. by detecting the amount of full-length or a truncated polypeptide gene product (e.g. immunologically with an antibody)).

[0071] The term “meso compound” is art-recognized and refers to a chemical compound which has at least two chiral centers but is achiral due to a plane or point of symmetry.

[0072] “Metabotropic glutamate receptor” (mGluR) is used herein to refer to the G protein-coupled receptors that respond to the neurotransmitter glutamate. Based upon their primary sequence similarity, signal transduction linkages and pharmacological profile, there are three groups of mGluR's. Group I consists of mGluR1 (mGluR1a, mGluR1b, mGluR1c, mGluR1d; e.g., GenBank Accession number NM_(—)000838 for human splice variant mGluR1a) and mGluR5 (mGluR5a, mGluR5b; e.g., GenBank Accession number NM_(—)000842 for human splice variant mGluR5a) that are positively coupled to phospholipase C. Group II consists of mGluR2 (e.g., GenBank Accession number NM_(—)000839) and mGluR3 (e.g., GenBank Accession number NM_(—)000840) that are negatively linked to adenyl cyclase. Group II consists of mGluR4 (mGluR4a, mGluR4b; e.g., GenBank Accession number NM_(—)000841), mGluR6 (e.g., GenBank Accession number NM_(—)000843), mGluR7 (mGluR7a, mGluR7b; e.g., GenBank Accession number NM_(—)000844 for the human splice variant of mGluR7a) and mGluR8 (e.g., GenBank Accession number NM_(—)000845) that are negatively linked to adenyl cyclase. There are a number of commercially available agonists and antagonists for the various mGluR groups. For example, Group I agonists include but are not limited to L-quisqualic acid ((L)-(+)-α-amino-3,5-dioxo-1,2,4-oxadiazolidine-2-propanoic acid), (S)-3,5-dihydroxyphenylglycine ((S)-3,5-DHPG), trans-azetidine-2,4-dicarboxylic acid (tADA), (1S,3R)-1-aminocyclopentane-1,3-dicarboxylic acid ((1S,3R)-ACPD) and (RS)-2-Chloro-5-hydroxyphenylglycine (CHPG); and antagonists include but are not limited to (S)-4-carboxy-3-hydroxyphenylglycine ((S)-4C3HPG), 7-(hydroxyimino)cyclopropa[b]chromen-1a-carboxylate ethyl ester (CPCCOEt), (RS)-1 aminoindan-1,5-dicarboxylic acid (AIDA; UPF 523), 2-methyl-6-(phenylethynyl)pyridine (MPEP hydrochloride), 2-methyl-6-(2-phenylethenyl) pyridine (SIB-1893), 6-methyl-2-(phenylazo)-3-pyridinol (SIB-1757), and (S)-(+)-α-amino-4-carboxy-2-methylbenzeneacetic acid (LY 367385). Group II agonists include (2S,2′R,3′R)-2-(2′,3′-dicarboxycyclopropyl)glycine (DCG IV), (2S,1′S,2′S)-2-(carboxycyclopropyl)glycine (L-CCG-I; (2S,3S,4S)-CCG), (S)-3 carboxy-4-hydroxyphenylglycine ((S)-3C4HPG) and (2R,4R)-4-aminopyrrolidine-2,4-dicarboxylate ((2R,4R)-APDC); and antagonists include (2S)-α-Ethylglutamic acid (EGLU) and (2S)-2-amino-2-[(1S,2S)-2-carboxycycloprop-1-yl]-3-(xanth-9-yl) propanoic acid (LY 341495). Group III agonist include (1S,3R,4S)-1-aminocyclopentane-1,2,4-tricarboxylic acid (ACPT-I), L(+)-2-amino-4-phosphonobutyric acid (L-AP4), (R,S)-4-phosphonophenylglycine ((R,S)-PPG) and O-phospho-L-serine (L-SOP); and antagonists include (RS)-α-Cyclopropyl-4-phosphonophenylglycine (CPPG), (S)-2-amino-2-methyl-4-phosphonobutanoic acid (MAP4) and (RS)-α-Methylserine-O-phosphate (MSOP). Recent evidence has shown that metabotropic glutamate receptors associated with glia can alter the expression of glutarnate transporters (Aronica et al., Eur. J. Neurosci. 2003; 17: 2106-18, 2003).

[0073] “Middle-age” is used herein to refer to a mammal that is past the age of sexual maturity, i.e., not young, but not yet approaching the average life span of the species, i.e., not aged. For example, a middle-aged rat would be of about 12-18 months of age. A middle-aged human would be of between twenty and seventy years of age.

[0074] “Neural tissue” is used herein to refer to tissues of the nervous system, ie., tissues comprising both neurons and glia. Where specified, neural tissue may refer to particular structures found in the brain, including “hippocampal tissue.” Hippocampal tissue refers to the seahorse shaped structure found in the temporal cortex that includes the following: entorhinal cortex, presubiculum, subiculum, prosubiculum, dentate gyrus, and areas known as CA1, CA2, CA3 and CA4. The hippocampus is involved in processes such as short-term memory, the formation of long-term memory, memory retrieval, declarative memory and spatial navigation.

[0075] “Neuroprotective” is used herein to refer to compositions and treatments that have the effect of reducing, arresting or ameliorating impaired cognitive function, and protecting, resuscitating, or reviving nervous tissue that has suffered impaired cognitive function.

[0076] The term “nitro” is art-recognized and refers to —NO₂; the term “halogen” is art-recognized and refers to —F, —Cl, —Br or —I; the term “sulfflydryl” is art-recognized and refers to —SH; the term “hydroxyl” means —OH; and the term “sulfonyl” is art-recognized and refers to —SO₂ ⁻. “Halide” designates the corresponding anion of the halogens, and “pseudohalide” has the definition set forth on 560 of “Advanced Inorganic Chemistry” by Cotton and Wilkinson.

[0077] The term “phosphoryl” is art-recognized and may in general be represented by the formula:

[0078] wherein Q50 represents S or O, and R59 represents hydrogen, a lower alkyl or an aryl. When used to substitute, e.g., an alkyl, the phosphoryl group of the phosphorylalkyl may be represented by the general formulas:

[0079] wherein Q50 and R59, each independently, are defined above, and Q51 represents O, S or N. When Q50 is S, the phosphoryl moiety is a “phosphorothioate”.

[0080] The term “phosphoramidite” is art-recognized and may be represented in the general formulas:

[0081] wherein Q51, R50, R51 and R59 are as defined above.

[0082] The term “phosphonamidite” is art-recognized and may be represented in the general formulas:

[0083] wherein Q51, R50, R51 and R59 are as defined above, and R60 represents a lower alkyl or an aryl.

[0084] Analogous substitutions may be made to alkenyl and alkynyl groups to produce, for example, aminoalkenyls, aminoalkynyls, amidoalkenyls, amidoalkynyls, iminoalkenyls, iminoalkynyls, thioalkenyls, thioalkynyls, carbonyl-substituted alkenyls or alkynyls. The definition of each expression, e.g. alkyl, m, n, and the like, when it occurs more than once in any structure, is intended to be independent of its definition elsewhere in the same structure.

[0085] The term “selenoalkyl” is art-recognized and refers to an alkyl group having a substituted seleno group attached thereto. Exemplary “selenoethers” which may be substituted on the alkyl are selected from one of —Se-alkyl, —Se-alkenyl, —Se-alkynyl, and —Se—(CH₂)_(m)—R61, m and R61 being defined above.

[0086] The terms triflyl, tosyl, mesyl, and nonaflyl are art-recognized and refer to trifluoromethanesulfonyl, p-toluenesulfonyl, methanesulfonyl, and nonafluorobutanesulfonyl groups, respectively. The terms triflate, tosylate, mesylate, and nonaflate are art-recognized and refer to trifluoromethanesulfonate ester, p-toluenesulfonate ester, methanesulfonate ester, and nonafluorobutanesulfonate ester functional groups and molecules that contain said groups, respectively.

[0087] The abbreviations Me, Et, Ph, Tf, Nf, Ts, and Ms represent methyl, ethyl, phenyl, trifluoromethanesulfonyl, nonafluorobutanesulfonyl, p-toluenesulfonyl and methanesulfonyl, respectively. A more comprehensive list of the abbreviations utilized by organic chemists of ordinary skill in the art appears in the first issue of each volume of the Journal of Organic Chemistry; this list is typically presented in a table entitled Standard List of Abbreviations.

[0088] “Pituitary adenyl cyclase activator polypeptide” (PACAP) is used herein to refer to the neuropolypeptide that is a potent activator of cAMP-dependent signaling pathways. PACAP acts as a multifunctional peptide and is involved in such diverse processes as the regulation of hormonal secretion, energy metabolism, neuronal survival and is a regulator of glial glutamate transporters EAAT1 and EAAT2 (Figiel and Engele, J. Neurosci. 15: 3596-3605, 2000). PACAP belongs to the secretin/glucagon/vasoactive intestinal peptide (VIP) superfamily, and exists in two amidated forms as PACAP38 (38-amino acid residues) and PACAP27 (27-amino acid residues) derived from the same precursor. The primary structure of PACAP has been remarkably conserved throughout evolution among tunicata, ichthyopsida, amphibia and mammalia, and a PACAP-like neuropeptide has also been determined in Drosophila. Besides PACAP-38 and PACAP-27, a third agonist of the PACAP receptor is Maxadilian. Maxadilan is a potent vasodilator peptide isolated from salivary glands extracts of the hematophagous sand fly. Recently, it was demonstrated that maxadilan binds to PACAP receptor type 1 in mammals, although maxadilan has no significant amino acid sequence homology with PACAP (Moro and Lemer: Maxadilan, J. Biol. Chem. 272(2):966-70, 1997). Both PACAP and its receptors are mainly distributed in the nervous and endocrine systems showing pleiotropic functions with high potency. Thus, PACAP peptides, Maxadilan or peptide derivatives and analogs, peptide-like compounds and small-molecule agonists triggering the PACAP receptor can be used to increase glutamate transporter activity.

[0089] The terms “polycyclyl” or “polycyclic group” are art-recognized and refer to two or more rings (e.g., cycloalkyls, cycloalkenyls, cycloalkynyls, aryls and/or heterocyclyls) in which two or more carbons are common to two adjoining rings, e.g., the rings are “fused rings”. Rings that are joined through non-adjacent atoms are termed “bridged” rings. Each of the rings of the polycycle may be substituted with such substituents as described above, as for example, halogen, alkyl, aralkyl, alkenyl, alkynyl, cycloalkyl, hydroxyl, amino, nitro, sulfhydryl, imino, amido, phosphonate, phosphinate, carbonyl, carboxyl, silyl, ether, alkylthio, sulfonyl, ketone, aldehyde, ester, a heterocyclyl, an aromatic or heteroaromatic moiety, —CF₃, —CN, or the like.

[0090] “Plurality” is used herein to refer to two or more.

[0091] The term “prodrug” is art-recognized and is intended to encompass compounds which, under physiological conditions, are converted into the antibacterial agents of the present invention. A common method for making a prodrug is to select moieties which are hydrolyzed under physiological conditions to provide the desired compound. In other embodiments, the prodrug is converted by an enzymatic activity of the host animal.

[0092] The term “protecting group” is art-recognized and refers to temporary substituents that protect a potentially reactive functional group from undesired chemical transformations. Examples of such protecting groups include esters of carboxylic acids, silyl ethers of alcohols, and acetals and ketals of aldehydes and ketones, respectively. The field of protecting group chemistry has been reviewed by Greene and Wuts in Protective Groups in Organic Synthesis (2^(nd) ed., Wiley: New York, 1991).

[0093] The term “hydroxyl-protecting group” is art-recognized and refers to those groups intended to protect a hydrozyl group against undesirable reactions during synthetic procedures and includes, for example, benzyl or other suitable esters or ethers groups known in the art.

[0094] The term “carboxyl-protecting group” is art-recognized and refers to those groups intended to protect a carboxylic acid group, such as the C-terminus of an amino acid or peptide or an acidic or hydroxyl azepine ring substituent, against undesirable reactions during synthetic procedures and includes. Examples for protecting groups for carboxyl groups involve, for example, benzyl ester, cyclohexyl ester, 4-nitrobenzyl ester, t-butyl ester, 4-pyridylmethyl ester, and the like.

[0095] The term “amino-blocking group” is art-recognized and refers to a group which will prevent an amino group from participating in a reaction carried out on some other fuinctional group, but which can be removed from the amine when desired. Such groups are discussed by in Ch. 7 of Greene and Wuts, cited above, and by Barton, Protective Groups in Organic Chemistry ch. 2 (McOmie, ed., Plenum Press, New York, 1973). Examples of suitable groups include acyl protecting groups such as, to illustrate, formyl, dansyl, acetyl, benzoyl, trifluoroacetyl, succinyl, methoxysuccinyl, benzyl and substituted benzyl such as 3,4-dimethoxybenzyl, o-nitrobenzyl, and triphenylmethyl; those of the formula —COOR where R includes such groups as methyl, ethyl, propyl, isopropyl, 2,2,2-trichloroethyl, 1-methyl-1-phenylethyl, isobutyl, t-butyl, t-amyl, vinyl, allyl, phenyl, benzyl, p-nitrobenzyl, o-nitrobenzyl, and 2,4-dichlorobenzyl; acyl groups and substituted acyl such as formyl, acetyl, chloroacetyl, dichloroacetyl, trichloroacetyl, trifluoroacetyl, benzoyl, and p-methoxybenzoyl; and other groups such as methanesulfonyl, p-toluenesulfonyl, p-bromobenzenesulfonyl, p-nitrophenylethyl, and p-toluenesulfonyl-aminocarbonyl. Preferred amino-blocking groups are benzyl (—CH₂C₆H₅), acyl [C(O)R1] or SiR1₃ where R1 is C₁-C₄ alkyl, halomethyl, or 2-halo-substituted-(C₂-C₄ alkoxy), aromatic urethane protecting groups as, for example, carbonylbenzyloxy (Cbz); and aliphatic urethane protecting groups such as t-butyloxycarbonyl (Boc) or 9-fluorenylmethoxycarbonyl (FMOC).

[0096] The definition of each expression, e.g. lower alkyl, m, n, p and the like, when it occurs more than once in any structure, is intended to be independent of its definition elsewhere in the same structure.

[0097] The term “electron-withdrawing group” is art-recognized, and refers to the tendency of a substituent to attract valence electrons from neighboring atoms, i.e., the substituent is electronegative with respect to neighboring atoms. A quantification of the level of electron-withdrawing capability is given by the Hammett sigma (σ) constant. This well known constant is described in many references, for instance, March, Advanced Organic Chemistry 251-59 (McGraw Hill Book Company: New York, 1977). The Hammett constant values are generally negative for electron donating groups (σ(P)=−0.66 for NH₂) and positive for electron withdrawing groups (σ(P)=0.78 for a nitro group), σ(P) indicating para substitution. Exemplary electron-withdrawing groups include nitro, acyl, formyl, sulfonyl, trifluoromethyl, cyano, chloride, and the like. Exemplary electron-donating groups include amino, methoxy, and the like.

[0098] “RNA” is used herein to refer to the various species of ribonucleic acids, such as messenger RNA, mature RNA, polyadenylated RNA, unpolyadenylated RNA and RNA that contains introns and/or 5′ or 3′ untranslated regions. “Expressed RNA” is used herein to refer to RNA that is transcribed from genomic or mitochondrial DNA by a polymerase.

[0099] The term “regioisomers” is art-recognized and refers to compounds which have the same molecular formula but differ in the connectivity of the atoms. Accordingly, a “regioselective process” is one which favors the production of a particular regioisomer over others, e.g., the reaction produces a statistically significant increase in the yield of a certain regioisomer. The term “epimers” is art-recognized and refers to molecules with identical chemical constitution and containing more than one stereocenter, but which differ in configuration at only one of these stereocenters.

[0100] “Small molecule” is used herein to refer to a composition, which has a molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small molecules can be nucleic acids, peptides, polypeptides, peptidomimetics, carbohydrates, lipids or other organic (carbon-containing) or inorganic molecules. Many pharmaceutical companies and suppliers have extensive libraries of chemical and/or biological mixtures, often fingal, bacterial, or algal extracts, which can be screened with any of the assays of the invention to identify compounds that modulate a bioactivity, such as a desired behavior or cognitive function.

[0101] The term “stereoisomers” is art-recognized and refers to compounds which have identical chemical constitution, but differ with regard to the arrangement of the atoms or groups in space. In particular, “enantiomers” refer to two stereoisomers of a compound which are non-superimposable mirror images of one another. “Diastereomers”, on the other hand, refers to stereoisomers with two or more centers of dissymmetry and whose molecules are not mirror images of one another.

[0102] Furthermore, a “stereoselective process” is one which produces a particular stereoisomer of a reaction product in preference to other possible stereoisomers of that product. An “enantioselective process” is one which favors production of one of the two possible enantiomers of a reaction product.

[0103] The term “structure-activity relationship” or “(SAR)” is art-recognized and refers to the way in which altering the molecular structure of a drug or other compound alters its interaction with a receptor, enzyme, nucleic acid or other target and the like.

[0104] “Subject” is used herein to refer to a mammal, e.g., a human, non-human primate, ovine, bovine, porcine, equine, feline, murine or canine. Preferably, the subject is a human. A subject or mammal “in need of” treatment according to the present invention has impaired cognitive function.

[0105] It will be understood that “substitution” or “substituted with” includes the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent, and that the substitution results in a stable compound, e.g., which does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, or other reaction.

[0106] The term “substituted” is also contemplated to include all permissible substituents of organic compounds. In a broad aspect, the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, aromatic and nonaromatic substituents of organic compounds. Illustrative substituents include, for example, those described herein above. The permissible substituents may be one or more and the same or different for appropriate organic compounds. For purposes of this invention, the heteroatoms such as nitrogen may have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valences of the heteroatoms. This invention is not intended to be limited in any manner by the permissible substituents of organic compounds.

[0107] The term “sulfonate” is art recognized and refers to a moiety that may be represented by the general formula:

[0108] in which R57 is an electron pair, hydrogen, alkyl, cycloalkyl, or aryl.

[0109] The term “sulfate” is art recognized and includes a moiety that may be represented by the general formula:

[0110] in which R57 is as defined above.

[0111] The term “sulfonamido” is art recognized and includes a moiety that may be represented by the general formula:

[0112] in which R50 and R56 are as defined above.

[0113] The term “sulfamoyl” is art-recognized and refers to a moiety that may be represented by the general formula:

[0114] in which R50 and R51 are as defined above.

[0115] The term “sulfonyl” is art-recognized and refers to a moiety that may be represented by the general formula:

[0116] in which R58 is one of the following: hydrogen, alkyl, alkenyl, alkynyl, cycloalkyl, heterocyclyl, aryl or heteroaryl.

[0117] The term “sulfoxido” is art-recognized and refers to a moiety that may be represented by the general formula:

[0118] in which R58 is defined above.

[0119] The term “synthetic” is art-recognized and refers to production by in vitro chemical or enzymatic synthesis.

[0120] “Test population” is used herein to refer to subjects having a desired behavior or cognitive function. Members of the test population may include young, middle-aged and aged subjects.

[0121] “Therapeutic agent” is used herein to refer to a chemical compound or composition capable of inducing a desired therapeutic or prophylactic effect when properly administered to a subject in need thereof. The “therapeutic agent” may be any chemical moiety or biologic that is a biologically, physiologically, or pharmacologically active substance that acts locally or systemically in a subject in need thereof. Examples of chemical therapeutic agents, also referred to as “drugs”, are described in well-known literature references such as the Merck Index, the Physicians Desk Reference, and The Pharmacological Basis of Therapeutics, and they include, without limitation, medicaments; vitamins; mineral supplements; substances used for the treatment, prevention, diagnosis, cure or mitigation of a disease or illness; substances which affect the structure or function of the body; or pro-drugs, which become biologically active or more active after they have been placed in a physiological environment. Antibiotic agents and Fab I/Fab K inhibitors are examples of therapeutic agents. Examples of biologic therapeutic agents include viral vectors that contain genes and deliver the gene to the subject.

[0122] Therapeutic agents induce a local or systemic effect in animals, particularly mammals, and more particularly humans caused by a pharmacologically active substance. Thus, a therapeutic agent may be used for the diagnosis, cure, mitigation, treatment or prevention of deleterious condition or in the enhancement of desirable physical or mental development and/or conditions in an animal or human.

[0123] To be effective, a therapeutic agent are delivered in an amount or concentration that produces some desired local or systemic effect at a reasonable benefit/risk ratio applicable to any treatment. The effective amount of such therapeutic agent will vary depending upon the subject and condition being treated, the weight and age of the subject, the severity of the disease condition, the manner of administration and the like, which can readily be determined by one of ordinary skill in the art. For example, certain compositions of the present invention may be administered in a sufficient amount to produce an effect at a reasonable benefit/risk ratio applicable to such treatment. In the context of impaired cognitive function the presence of degree of therapeutic effect can be assessed using standard behavioral or other tests known in the art for assessing cognitive function.

[0124] The term “trans” is art-recognized and refers to the arrangement of two atoms or groups around a double bond such that the atoms or groups are on the opposite sides of a double bond. Trans configurations are often labeled as (E) configurations.

[0125] “Treating” impaired cognitive function in a subject or “treating” a subject having impaired cognitive function are used herein to refer to providing the subject with a therapeutic agent by any appropriate means, e.g., the administration of a drug, such that at least one symptom of the impaired cognitive function is stabilized or decreased. Treating impaired cognitive function can be preventing the impairment, delaying progression of the impairment or improving the impairment (lessening disease severity) or curing the impairment.

[0126] “Vector” is used herein to refer to compositions that may be used to introduce DNA or RNA into tissue. Methods which are well known to those skilled in the art can be used to construct expression vectors containing a nucleic acid encoding the protein of interest linked to appropriate transcriptional/translational control signals. See, for example, the techniques described in Sambrook & Russell, Molecular Cloning, A Laboratory Manual (3^(rd) Edition), Cold Spring Harbor Laboratory, N.Y. (2001) and Ausebel et al. Current Protocols in Molecular Biology, Greene Publishing Associates & Wiley Interscience, N.Y (1989).

[0127] Suitable methods for transferring vectors or plasmids into cells include lipid/DNA complexes, such as those described in U.S. Pat. Nos. 5,578,475; 5,627,175; 5,705,308; 5,744,335; 5,976,567; 6,020,202; and 6,051,429. Suitable reagents include lipofectamine, a 3:1 (w/w) liposome formulation of the poly-cationic lipid 2,3-dioleyloxy-N-[2(sperminecarbox-amido)ethyl]-N,N-dimethyl-1-propanaminium trifluoroacetate (DOSPA) (Chemical Abstracts Registry name: N-[2-(2,5-bis[(3-aminopropyl)amino]-1-oxpentyl)amino)ethyl]-N,N-dimethyl-2,3-bis(9-octadecenyloxy)-1-propanamin-trifluoroacetate), and the neutral lipid dioleoyl phosphatidylethanolamine (DOPE) in membrane filtered water. Exemplary is the formulation Lipofectamine 2000™ (available from Invitrogen (formerly Gibco/Life Technologies) # 11668019). Other reagents include: FUGENE™ 6 Transfection Reagent (a blend of lipids in non-liposomal form and other compounds in 80% ethanol, obtainable from Roche Diagnostics Corp. # 1814443); and LipoTAXI™ transfection reagent (a lipid formulation from Invitrogen Corp., produce the desired biologically active protein. #204110). Transfection of cells can be performed by electroporation, e.g., as described in Roach and McNeish (Methods in Mol. Biol. 185:1 (2002)). Suitable viral vector systems for producing cells with stable genetic alterations may be based on adenoviruses, lentiviruses, retroviruses, adeno-associated viruses (AAV) and other viruses, and may be prepared using commercially available virus components. Vectors can be introduced into neural cells and tissues by art-known methods, including injection (e.g., in to a specified region of the brain), by use of a shunt to the ventricular space or cerebrospinal fluid and other mechanical means.

[0128] “Young” refers to adolescents and normal adult mammals at about the age of sexual maturity and when the hippocampus has just fuilly mature. In the case of rats, a “young” rat would be 6-9 months of age. In the case of humans, a “young” human would be 10-20 years of age.

6.2 Introduction: Combination Studies of Behavioral and Genetic Assessments of Cognitive Function

[0129] Behavioral assessments of cognitive function with the Morris Water Maze and Radial Arm Maze have been useful in identifying age-related changes in cognitive function. Upon using these behavioral assessments as a method for phenotyping animals on the basis of their cognitive function, one may combine behavioral assessments with genetic and physiological measurements of cognitive function to detect differences in the effects of aging on the brain.

[0130] The use of gene expression arrays offers the potential to simultaneously analyze up to thousands of expressed genes in order to gain a genetic template of age- and behavior-associated changes in the brain. Such approaches also offer some challenges. First, our rat model, like the aging human population comprises a genetically outbred population which can add individual variability as a confounding factor in gene expression profiling. Second, using traditional quantitative methods to assess levels of specific mRNAs in hippocampus, we have found that age and behavior related changes in gene expression are often relatively small, smaller than the two fold differences in levels of gene expression which have been reported as the limits of discriminating power in existing Genechipg or microarray approaches. For example, work by Landfield and colleagues was limited by not being able to detect less than two-fold changes in gene expression (WO 03/025122 A2).

[0131] Herein we describe strategies which overcome such challenges, demonstrating reliable detection of small changes in expression of genes which we have shown by traditional methods to differ between aged and young rats. Analyses of a wider range of genes indicate that this method reproducibly reveals a substantial number of genes which show changes in expression within the hippocampus that are associated with behavioral status in aged rats.

[0132] The identification of genes associated with cognitive impairment allows one for the first time to determine whether a candidate compound can modulate expression of genes associated with normal cognitive function. Compounds that modulate expression of such genes so as to more closely approximate the level of expression thereof in a mammal, e.g., a human, having a desired cognitive function are expected to restore or improve cognitive function when used as therapeutic agents. Using this approach, we report the discovery of genes assoicated with the preservation of cognitive function in aged mammals. Without being limited by speculation, we believe that such preservation represents an active biological process that can be triggered or induced by treatment with appropriate therapeutic agents. Indeed, we report herein that one such agent is ceftriaxone, a third generation cephalosporin. Other such agents are valproic acid and MS-153. Additional such therapeutic agents can be identified and optimized using the screening methods described below. The experiment approaches which led to the invention and the the invention itself as well as techniques for practice of the invention are set forth in the following sections.

[0133] 6.2.1 Isolating RNA

[0134] When isolating RNA from tissue samples or cells from individuals, it may be important to prevent any further changes in gene expression after the tissue or cells has been removed from the subject. Changes in expression levels are known to occur rapidly following perturbations, e.g., heat shock or activation with lipopolysaccharide (LPS) or other reagents. In addition, the RNA in the tissue and cells may quickly become degraded. Accordingly, in a preferred embodiment, the tissue or cells obtained from a subject is snap frozen as soon as possible.

[0135] RNA can be extracted from the tissue sample by a variety of methods, e.g., those described in the Examples or guanidium thiocyanate lysis followed by CsCl centrifugation (Chirgwin et al., 1979, Biochemistry 18:5294-5299). RNA from frozen tissue can be isolated by homogenizing the tissue in a phenol/guanidinium thiocyanate mixture (available from Invitrogen) and extracted with chloroform followed by precipitation with isopropanol. The RNA pellet can then be resuspended and further purified over RNeasy columns (Qiagen). All RNA may be stored at −80° in the absence of RNase inhibitors and the integrity assessed by agarose gel electrophoreisis. RNA from single cells can be obtained as described in methods for preparing cDNA libraries from single cells, such as those described in Dulac, C. (1998) Curr. Top. Dev. Biol. 36, 245 and Jena et al. (1996) J. Immunol. Methods 190:199. Care to avoid RNA degradation must be taken, e.g., by inclusion of RNAse inhibitor.

[0136] The RNA sample can then be enriched in particular species. In one embodiment, poly(A)+RNA is isolated from the RNA sample. In general, such purification takes advantage of the poly-A tails on mRNA. In particular and as noted above, poly-T oligonucleotides may be immobilized within on a solid support to serve as affinity ligands for mRNA. Kits for this purpose are commercially available, e.g., the MessageMaker kit (Invitrogen #10298016).

[0137] In a preferred embodiment, the RNA population is enriched in sequences of interest, such as those of genes involved in cognitive function. Enrichment can be undertaken, e.g., by primer-specific cDNA synthesis, or multiple rounds of linear amplification based on cDNA synthesis and template-directed in vitro transcription (see, e.g., Wang et al. (1989) PNAS 86, 9717; Dulac et al., supra, and Jena et al., supra).

[0138] The population of RNA, enriched or not in particular species or sequences, can further be amplified. Such amplification is particularly important when using RNA from a single or a few cells. A variety of amplification methods are suitable for use in the methods of the invention, including, e.g., PCR; ligase chain reaction (LCR) (see, e.g., Wu and Wallace, Genomics 4, 560 (1989), Landegren et al., Science 241, 1077 (1988)); self-sustained sequence replication (SSR) (see, e.g., Guatelli et al., Proc. Nat. Acad. Sci. USA, 87, 1874 (1990)); nucleic acid based sequence amplification (NASBA) and transcription amplification (see, e.g., Kwoh et al., Proc. Natl. Acad. Sci. USA 86, 1173 (1989)). For PCR technology, see, e.g., PCR Technology: Principles and Applications for DNA Amplification (ed. H. A. Erlich, Freeman Press, N.Y., N.Y., 1992); PCR Protocols: A Guide to Methods and applications (eds. Innis, et al., Academic Press, San Diego, Calif., 1990); Mattila et al., Nucleic Acids Res. 19, 4967 (1991); Eckert et al., PCR Methods and Applications 1, 17 (1991); PCR (eds. McPherson et al., IRL Press, Oxford); and U.S. Pat. No. 4,683,202. Methods of amplification are described, e.g., in Ohyama et al. (2000) BioTechniques 29:530; Luo et al. (1999) Nat. Med. 5, 117; Hegde et al. (2000) BioTechniques 29:548; Kacharmina et al. (1999) Meth. Enzymol. 303:3; Livesey et al. (2000) Curr. Biol. 10:301; Spirin et al. (1999) Invest. Ophtalmol. Vis. Sci. 40:3108; and Sakai et al. (2000) Anal. Biochem. 287:32. RNA amplification and cDNA synthesis can also be conducted in cells in situ (see, e.g., Eberwine et al. (1992) PNAS 89:3010). “Quantitative PCR” refers to using a PCR protocol that allows one to determine the amount of reaction product or number of reaction products in a sample.

[0139] One of skill in the art will appreciate that whatever amplification method is used, if a quantitative result is desired, care must be taken to use a method that maintains or controls for the relative frequencies of the amplified nucleic acids to achieve quantitative amplification. Methods of “quantitative” amplification are well known to those of skill in the art. For example, quantitative PCR involves simultaneously co-amplifying a known quantity of a control sequence using the same primers. This provides an internal standard that may be used to calibrate the PCR reaction. A high density array may then include probes specific to the internal standard for quantification of the amplified nucleic acid.

[0140] One preferred internal standard is a synthetic AW106 cRNA. The AW106 cRNA is combined with RNA isolated from the sample according to standard techniques known to those of skilled in the art. The RNA is then reverse transcribed using a reverse transcriptase to provide copy DNA. The cDNA sequences are then amplified (e.g., by PCR) using labeled primers. The amplification products are separated, typically by electrophoresis, and the amount of radioactivity (proportional to the amount of amplified product) is determined. The amount of MRNA in the sample is then calculated by comparison with the signal produced by the known AW106 RNA standard. Detailed protocols for quantitative PCR are provided in PCR Protocols, A Guide to Methods and Applications, Innis et al., Academic Press, Inc. N.Y., (1990).

[0141] In a preferred embodiment, a sample MRNA is reverse transcribed with a reverse transcriptase and a primer consisting of oligo(dT) and a sequence encoding the phage T7 promoter to provide single stranded DNA template. The second DNA strand is polymerized using a DNA polymerase. After synthesis of double-stranded cDNA, T7 RNA polymerase is added and RNA is transcribed from the cDNA template. Successive rounds of transcription from each single cDNA template results in amplified RNA. Methods of in vitro polymerization are well known to those of skill in the art (see, e.g., Sambrook & Russell, (supra) and this particular method is described in detail by Van Gelder, et al., Proc. Natl. Acad. Sci. USA, 87: 1663-1667 (1990) who demonstrate that in vitro amplification according to this method preserves the relative frequencies of the various RNA transcripts). Moreover, Eberwine et al. Proc. Natl. Acad. Sci. USA, 89: 3010-3014 provide a protocol that uses two rounds of amplification via in vitro transcription to achieve greater than 106 fold amplification of the original starting material, thereby permitting expression monitoring even where biological samples are limited.

[0142] It will be appreciated by one of skill in the art that the direct transcription method described above provides an antisense (aRNA) pool. Where antisense RNA is used as the target nucleic acid, the oligonucleotide probes provided in the array are chosen to be complementary to subsequences of the antisense nucleic acids. Conversely, where the target nucleic acid pool is a pool of sense nucleic acids, the oligonucleotide probes are selected to be complementary to subsequences of the sense nucleic acids. Finally, where the nucleic acid pool is double stranded, the probes may be of either sense as the target nucleic acids include both sense and antisense strands.

[0143] 6.2.2 Analyzing RNA

[0144] In certain embodiments, it is sufficient to determine the expression of one or only a few genes, as opposed to hundreds or thousands of genes. Although microarrays can be used in these embodiments, various other methods of detection of gene expression are available. This section describes a few exemplary methods for detecting and quantifying mRNA or polypeptide encoded thereby. Where the first step of the methods includes isolation of mRNA from cells, this step can be conducted as described above. Labeling of one or more nucleic acids can be performed as described below.

[0145] In one embodiment, mRNA obtained from a sample is reverse transcribed into a first cDNA strand and subjected to PCR, e.g., RT-PCR. House keeping genes, or other genes whose expression does not vary can be used as internal controls and controls across experiments. Following the PCR reaction, the amplified products can be separated by electrophoresis and detected. By using quantitative PCR, the level of amplified product will correlate with the level of RNA that was present in the sample. The amplified samples can also be separated on a agarose or polyacrylamide gel, transferred onto a filter, and the filter hybridized with a probe specific for the gene of interest. Numerous samples can be analyzed simultaneously by conducting parallel PCR amplification, e.g., by multiplex PCR.

[0146] A quantitative PCR technique that can be used is based on the use of TaqMan™ probes. Specific sequence detection occurs by amplification of target sequences in the PE Applied Biosystems 7700 Sequence Detection System in the presence of an oligonucleotide probe labeled at the 5′ and 3′ ends with a reporter and quencher fluorescent dye, respectively (FQ probe), which anneals between the two PCR primers. Only specific product will be detected when the probe is bound between the primers. As PCR amplification proceeds, the 5′-nuclease activity of Taq polymerase initially cleaves the reporter dye from the probe. The signal generated when the reporter dye is physically separated from the quencher dye is detected by measuring the signal with an attached CCD camera. One can also use an intercalating dye such as sybr green. Each signal generated equals one probe cleaved which corresponds to amplification of one target strand. PCR reactions may be set up using the PE Applied Biosystem TaqMan PCR Core Reagent Kit according to the instructions supplied. This technique is further described, e.g., in U.S. Pat. No. 6,326,462. Alternatively, probes can be obtained from Applied Biosystems and Qiagen for use with Invitrogen's Platinum quantitative PCR kit and the Rotorgene 3000.

[0147] In another embodiment, mRNA levels is determined by dotblot analysis and related methods (see, e.g., G. A. Beltz et al., in Methods in Enzymology, Vol. 100, Part B, R. Wu, L. Grossmam, K. Moldave, Eds., Academic Press, New York, Chapter 19, pp. 266-308, 1985). In one embodiment, a specified amount of RNA extracted from cells is blotted (i.e., non-covalently bound) onto a filter, and the filter is hybridized with a probe of the gene of interest. Numerous RNA samples can be analyzed simultaneously, since a blot can comprise multiple spots of RNA. Hybridization is detected using a method that depends on the type of label of the probe. In another dotblot method, one or more probes of one or more genes which are up- or down-regulated in cognitive impairment are attached to a membrane, and the membrane is incubated with labeled nucleic acids obtained from and optionally derived from RNA of a cell or tissue of a subject. Such a dot blot is essentially an array comprising fewer probes than a microarray.

[0148] “Dot blot” hybridization gained wide-spread use, and many versions were developed (see, e.g., M. L. M. Anderson and B. D. Young, in Nucleic Acid Hybridization-A Practical Approach, B. D. Hames and S. J. Higgins, Eds., IRL Press, Washington D.C., Chapter 4, pp. 73-111, 1985).

[0149] Another format, the so-called “sandwich” hybridization, involves covalently attaching oligonucleotide probes to a solid support and using them to capture and detect multiple nucleic acid targets (see, e.g., M. Ranki et al., Gene, 21, pp. 77-85, 1983; A. M. Palva, T. M. Ranki, and H. E. Soderlund, in UK Patent Application GB 2156074A, Oct. 2, 1985; T. M. Ranki and H. E. Soderlund in U.S. Pat. No. 4,563,419, Jan. 7, 1986; A. D. B. Malcolm and J. A. Langdale, in PCT WO 86/03782, Jul. 3, 1986; Y. Stabinsky, in U.S. Pat. No. 4,751,177, Jan. 14, 1988; T. H. Adams et al., in PCT WO 90/01564, Feb. 22, 1990; R. B. Wallace et al. 6 Nucleic Acid Res. 11, p. 3543, 1979; and B. J. Connor et al., 80 Proc. Natl. Acad. Sci. USA pp. 278-282, 1983). Multiplex versions of these formats are called “reverse dot blots.”

[0150] mRNA levels can also be determined by Northern blots. Specific amounts of RNA are separated by gel electrophoresis and transferred onto a filter which are then hybridized with a probe corresponding to the gene of interest. This method, although more burdensome when numerous samples and genes are to be analyzed provides the advantage of being very accurate.

[0151] A preferred method for high throughput analysis of gene expression is the serial analysis of gene expression (“SAGE”) technique, first described in Velculescu et al. (1995) Science 270, 484-487. Among the advantages of SAGE is that it has the potential to provide detection of all genes expressed in a given cell type, provides quantitative information about the relative expression of such genes, permits ready comparison of gene expression of genes in two cells, and yields sequence information that can be used to identify the detected genes. Thus far, SAGE methodology has proved itself to reliably detect expression of regulated and nonregulated genes in a variety of cell types (Velculescu et al. (1997) Cell 88, 243-251; Zhang et al. (1997) Science 276, 1268-1272 and Velculescu et al. (1999) Nat. Genet. 23, 387-388).

[0152] Techniques for producing and probing nucleic acids are further described, for example, in Sambrook & Russell, (supra).

[0153] Alternatively, the level of expression of one or more genes which are up- or down-regulated in cognitive impairment is determined by in situ hybridization histochemistry. In one embodiment, a tissue sample is obtained from a subject, a thin section is prepared, and in situ hybridization is performed according to methods known in the art, to determine the level of expression of the genes of interest.

[0154] The above methods may be used to assess an increase expression of an endogenous gene that may be activated by introducing into the mammal a new transcriptional unit, or gene activation construct, that comprises an exogenous regulatory sequence, an exogenous exon, and a splice site, operably linked to the second exon of an endogenous gene, wherein the cell comprises the exogenous exon in addition to exons present in the endogenous gene (see, for example, U.S. Pat. Nos.: 5,641.670; 5,773,746; 5,733,761; 5,968,502; 6,702,989 and 6,565,844).

[0155] In other methods, the level of expression of a gene is detected by measuring the level of protein encoded by the gene. This can be done, e.g., by immunoprecipitation, ELISA, or immunohistochemistry using an agent, e.g., an antibody, that specifically detects the protein encoded by the gene. Other techniques include Western blot analysis. Immunoassays are commonly used to quantitate the levels of proteins in cell samples, and many other immunoassay techniques are known in the art. The invention is not limited to a particular assay procedure, and therefore is intended to include both homogeneous and heterogeneous procedures. Exemplary immunoassays which can be conducted according to the invention include fluorescence polarization immunoassay (FPIA), fluorescence immunoassay (FIA), enzyme immunoassay (EIA), nephelometric inhibition immunoassay (NIA), enzyme linked immunosorbent assay (ELISA), and radioimmunoassay (RIA). An indicator moiety, or label group, can be attached to the subject antibodies and is selected so as to meet the needs of various uses of the method which are often dictated by the availability of assay equipment and compatible immunoassay procedures. General techniques to be used in performing the various immunoassays noted above are known to those of ordinary skill in the art.

[0156] In the case of polypeptides which are secreted from cells, the level of expression of these polypeptides can be measured in biological fluids.

[0157] In some embodiments, mRNA levels are detected and/or measured by microarray analysis as described in detail in the following sections.

6.3 Introduction: Microarray

[0158] Generally, determining expression profiles with arrays involves the following steps: (a) obtaining a mRNA sample from a subject and preparing labeled nucleic acids therefrom (the “target nucleic acids” or “targets”); (b) contacting the target nucleic acids with the array under conditions sufficient for target nucleic acids to bind with corresponding probes on the array, e.g. by hybridization or specific binding; (c) optionally removing unbound targets from the array; (d) detecting bound targets, and (e) analyzing the results. As used herein, “nucleic acid probes” or “probes” are nucleic acids attached to the array, whereas “target nucleic acids” are nucleic acids that are hybridized to the array. Each of these steps is described in more detail below.

[0159] 6.3.1 Labeling the Nucleic Acid for the Microarray Analysis

[0160] Generally, the target molecules will be labeled to permit detection of hybridization of target molecules to a microarray. By “labeled” is meant that the probe comprises a member of a signal producing system and is thus detectable, either directly or through combined action with one or more additional members of a signal producing system. Examples of directly detectable labels include isotopic and fluorescent moieties incorporated into, usually covalently bonded to, a moiety of the probe, such as a nucleotide monomeric unit, e.g. dNMP of the primer, or a photoactive or chemically active derivative of a detectable label which can be bound to a functional moiety of the probe molecule.

[0161] Nucleic acids can be labeled after or during enrichment and/or amplification of RNAs. For example, labeled cDNA can be prepared from MRNA by oligo dT-primed or random-primed reverse transcription, both of which are well known in the art (see, e.g., Klug and Berger, 1987, Methods Enzymol. 152:316-325). Reverse transcription may be carried out in the presence of a dNTP conjugated to a detectable label, most preferably a fluorescently labeled dNTP. Alternatively, isolated MRNA can be converted to labeled antisense RNA synthesized by in vitro transcription of double-stranded cDNA in the presence of labeled dNTPs (Lockhart et al., Nature Biotech. 14:1675, 1996). In alternative embodiments, the cDNA or RNA probe can be synthesized in the absence of detectable label and may be labeled subsequently, e.g., by incorporating biotinylated dNTPs or rNTP, or some similar means (e.g., photo-cross-linking a psoralen derivative of biotin to RNAs), followed by addition of labeled streptavidin (e.g., phycoerythrin-conjugated streptavidin) or the equivalent.

[0162] In one embodiment, labeled cDNA is synthesized by incubating a mixture containing RNA and 0.5 mM dGTP, dATP and dCTP plus 0.1 mM dTTP plus fluorescent deoxyribonucleotides (e.g., 0.1 mM Rhodamine 110 UTP (Perken Elmer Cetus) or 0.1 mM Cy3 dUTP (Amersham)) with reverse transcriptase (e.g., SuperScript™II, LTI Inc.) at 42° C. for 60 min.

[0163] Fluorescent moieties or labels of interest include coumarin and its derivatives, e.g. 7-amino-4-methylcoumarin, aminocoumarin, bodipy dyes, such as Bodipy FL, cascade blue, fluorescein and its derivatives, e.g. fluorescein isothiocyanate, Oregon green, rhodamine dyes, e.g. Texas red, tetramethylrhodamine, eosins and erythrosins, cyanine dyes, e.g. Cy2, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, FluorX, macrocyclic chelates of lanthanide ions, e.g. quantum dye™, fluorescent energy transfer dyes, such as thiazole orange-ethidium heterodimer, TOTAB, dansyl, etc. Individual fluorescent compounds which have functionalities for linking to an element desirably detected in an apparatus or assay of the invention, or which can be modified to incorporate such functionalities include, e.g., dansyl chloride; fluoresceins such as 3,6-dihydroxy-9-phenylxanthydrol; rhodamineisothiocyanate; N-phenyl 1-amino-8-sulfonatonaphthalene; N-phenyl 2-amino-6-sulfonatonaphthalene; 4-acetamido-4-isothiocyanato-stilbene-2,2′-disulfonic acid; pyrene-3-sulfonic acid; 2-toluidinonaphthalene-6-sulfonate; N-phenyl-N-methyl-2-aminoaphthalene-6-sulfonate; ethidium bromide; stebrine; auromine-0,2-(9′-anthroyl)palmitate; dansyl phosphatidylethanolamine; N,N′-dioctadecyl oxacarbocyanine: N,N′-dihexyl oxacarbocyanine; merocyanine, 4-(3′-pyrenyl)stearate; d-3-aminodesoxy-equilenin; 12-(9′-anthroyl)stearate; 2-methylanthracene; 9-vinylanthracene; 2,2′(vinylene-p-phenylene)bisbenzoxazole; p-bis(2-methyl-5-phenyl-oxazolyl))benzene; 6-dimethylamino-1,2-benzophenazin; retinol; bis(3′-aminopyridinium) 1,10-decandiyl diiodide; sulfonaphthylhydrazone of hellibrienin; chlorotetracycline; N-(7-dimethylamino-4-methyl-2-oxo-3-chromenyl)maleimide; N-(p-(2benzimidazolyl)-phenyl)maleimide; N-(4-fluoranthyl)maleimide; bis(homovanillic acid); resazarin; 4-chloro-7-nitro-2,1,3-benzooxadiazole; merocyanine 540; resorufin; rose bengal; and 2,4-diphenyl-3(2H)-furanone. (see, e.g., Kricka, 1992, Nonisotopic DNA Probe Techniques, Academic Press San Diego, Calif.). Many fluorescent tags are commercially available from SIGMA-Aldrich, Amersham Biosciences, Molecular Probes, Pfizer (formerly Pharmacia), BD Biosciences (formerly CLONTECH), ChemGenes Corp., Glen Research Corp., Invitrogen, Fluka Chemica-Biochemika Analytika (Fluka Chemie AG, Buchs, Switzerland), and Applied Biosystems (Foster City, Calif.) as well as other commercial sources known to one of skill.

[0164] Chemiluminescent labels include luciferin and 2,3-dihydrophthalazinediones, e.g., luminol.

[0165] Isotopic moieties or labels of interest include ³²P, ³³P, ³⁵S, ¹²⁵I, ²H, ¹⁴C, and the like (see Zhao et al., Gene 156:207, 1995; Pietu et al., Genome Res. 6:492, 1996).

[0166] Labels may also be members of a signal producing system that act in concert with one or more additional members of the same system to provide a detectable signal. Illustrative of such labels are members of a specific binding pair, such as ligands, e.g. biotin, fluorescein, digoxigenin, antigen, polyvalent cations, chelator groups and the like, where the members specifically bind to additional members of the signal producing system, where the additional members provide a detectable signal either directly or indirectly, e.g. antibody conjugated to a fluorescent moiety or an enzymatic moiety capable of converting a substrate to a chromogenic product, e.g. alkaline phosphatase conjugate antibody and the like.

[0167] Additional labels of interest include those that provide for signal only when the probe with which they are associated is specifically bound to a target molecule, where such labels include: “molecular beacons” as described in Tyagi & Kramer, Nature Biotechnology 14:303, 1996 and EP 0 070 685 B1. Other labels of interest include those described in U.S. Pat. No. 5,563,037; WO 97/17471 and WO 97/17076.

[0168] In some cases, hybridized target nucleic acids may be labeled following hybridization. For example, where biotin labeled dNTPs are used in, e.g., amplification or transcription, streptavidin linked reporter groups may be used to label hybridized complexes.

[0169] In other embodiments, the target nucleic acid is not labeled. In this case, hybridization can be determined, e.g., by plasmon resonance, as described, e.g., in Thiel et al., Anal. Chem. 69:4948, 1997.

[0170] In one embodiment, a plurality (e.g., 2, 3, 4, 5 or more) of sets of target nucleic acids are labeled and used in one hybridization reaction (“multiplex” analysis). For example, one set of nucleic acids may correspond to RNA from one cell or tissue sample and another set of nucleic acids may correspond to RNA from another cell or tissue sample. The plurality of sets of nucleic acids can be labeled with different labels, e.g., different fluorescent labels which have distinct emission spectra so that they can be distinguished. The sets can then be mixed and hybridized simultaneously to one microarray.

[0171] The use of a two-color fluorescence labeling and detection scheme to define alterations in gene expression has been described, e.g., in Shena et al., Science 270:467-470, 1995. An advantage of using cDNA labeled with two different fluorophores is that a direct and internally controlled comparison of the mRNA levels corresponding to each arrayed gene in two cell states can be made, and variations due to minor differences in experimental conditions (e.g, hybridization conditions) will not affect subsequent analyses.

[0172] Examples of distinguishable labels for use when hybridizing a plurality of target nucleic acids to one array are well known in the art and include: two or more different emission wavelength fluorescent dyes, like Cy3 and Cy5, combination of fluorescent proteins and dyes, like phicoerythrin and Cy5, two or more isotopes with different energy of emission, like ³²P and ³³P, gold or silver particles with different scattering spectra, labels which generate signals under different treatment conditions, like temperature, pH, treatment by additional chemical agents, etc., or generate signals at different time points after treatment. Using one or more enzymes for signal generation allows for the use of an even greater variety of distinguishable labels, based on different substrate specificity of enzymes (alkaline phosphatase/peroxidase).

[0173] Further, it is preferable in order to reduce experimental error to reverse the fluorescent labels in two-color differential hybridization experiments to reduce biases peculiar to individual genes or array spot locations. In other words, it is preferable to first measure gene expression with one labeling (e.g., labeling nucleic acid from a first cell with a first fluorochrome and nucleic acid from a second cell with a second fluorochrome) of the mRNA from the two cells being measured, and then to measure gene expression from the two cells with reversed labeling (e.g., labeling nucleic acid from the first cell with the second fluorochrome and nucleic acid from the second cell with the first fluorochrome). Multiple measurements over exposure levels and perturbation control parameter levels provide additional experimental error control.

[0174] The quality of labeled nucleic acids can be evaluated prior to hybridization to an array. For example, a sample of the labeled nucleic acids can be hybridized to probes derived from the 5′, middle and 3′ portions of genes known to be or suspected to be present in the nucleic acid sample. This will be indicative as to whether the labeled nucleic acids are full length nucleic acids or whether they are degraded. In one embodiment, the GeneChip® Test3 Array from Affymetrix (Santa Clara, Calif.) can be used for that purpose. This array contains probes representing a subset of characterized genes from several organisms including mammals. Thus, the quality of a labeled nucleic acid sample can be determined by hybridization of a fraction of the sample to an array, such as the GeneChip® Test3 Array from Affymetrix (Santa Clara, Calif.).

[0175] 6.3.2 Microarray Analysis

[0176] Preferred arrays, e.g., microarrays, for use according to the invention include one or more probes of genes which are candidate genes for their involvement in cognitive function. Exemplary arrays include one or more genes of interest to studying cognitive function such as those genes found on the GeneChip® Rat Expression Set 230 or GeneChip® Rat Neurobiology U34 Array, which contains over 1,200 sequences relevant to the study of neurobiology (including genes for kinases, cell surface). Additionally, one could use the GeneChip® HuSNP™ Array to survey the entire human genome by simultaneously tracking nearly 1,500 genetic variations, known as single nucleotide polymorphisms (SNPs), dispersed throughout the genome. SNPs are excellent markers for genomic searches because they are simple, abundant, widespread, and account for most of the genetic variability across human populations. Using high-throughput techniques, such as GeneChip® arrays, SNPs can be more easily tracked than traditional markers, such as microsatellite sequences.

[0177] The array may comprise probes corresponding to at least 10, preferably at least 20, at least 50, at least 100 or at least 1000 genes. The array may comprise probes corresponding to about 10%, 20%, 50%, 70%, 90% or 95% of the genes listed in FIG. 3 or other genes available on a microarray. The array may comprise probes corresponding to about 10%, 20%, 50%, 70%, 90% or 95% of the genes listed in FIG. 3 or other gene whose expression is at least 2 fold, preferably at least 3 fold, more preferably at least 4 fold, 5 fold, 7 fold and most preferably at least about 10 fold higher in cells. One exemplary preferred array that can be used is the array used and described in the Examples.

[0178] There can be one or more than one probe corresponding to each gene on a microarray. For example, a microarray may contain from 2 to 20 probes corresponding to one gene and preferably about 5 to 10. The probes may correspond to the full length RNA sequence or complement thereof of genes characteristic of candidate disease genes., or they may correspond to a portion thereof, which portion is of sufficient length for permitting specific hybridization. Such probes may comprise from about 50 nucleotides to about 100, 200, 500, or 1000 nucleotides or more than 1000 nucleotides. As further described herein, microarrays may contain oligonucleotide probes, consisting of about 10 to 50 nucleotides, preferably about 15 to 30 nucleotides and even more preferably 20-25 nucleotides. The probes are preferably single stranded. The probe will have sufficient complementarity to its target to provide for the desired level of sequence specific hybridization (see below).

[0179] Typically, the arrays used in the present invention will have a site density of greater than 100 different probes per cm2. Preferably, the arrays will have a site density of greater than 500/cm2, more preferably greater than about 1000/cm2, and most preferably, greater than about 10,000/cm2. Preferably, the arrays will have more than 100 different probes on a single substrate, more preferably greater than about 1000 different probes still more preferably, greater than about 10,000 different probes and most preferably, greater than 100,000 different probes on a single substrate.

[0180] Microarrays can be prepared by methods known in the art, as described below, or they can be custom made by companies, e.g., Affymetrix (Santa Clara, Calif.).

[0181] Generally, two types of microarrays can be used. These two types are referred to as “synthesis” and “delivery.” In the synthesis type, a microarray is prepared in a step-wise fashion by the in situ synthesis of nucleic acids from nucleotides. With each round of synthesis, nucleotides are added to growing chains until the desired length is achieved. In the delivery type of microarray, preprepared nucleic acids are deposited onto known locations using a variety of delivery technologies. Numerous articles describe the different microarray technologies, e.g., Shena et al., Tibtech 16: 301, 1998; Duggan et al., Nat. Genet. 21:10, 1999; Bowtell et al., Nat. Genet. 21: 25, 1999.

[0182] One novel synthesis technology is that developed by Affymetrix (Santa Clara, Calif.), which combines photolithography technology with DNA synthetic chemistry to enable high density oligonucleotide microarray manufacture. Such chips contain up to 400,000 groups of oligonucleotides in an area of about 1.6 cm2. Oligonucleotides are anchored at the 3′ end thereby maximizing the availability of single-stranded nucleic acid for hybridization. Generally such chips, referred to as “GeneChips®” contain several oligonucleotides of a particular gene, e.g., between 15-20, such as 16 oligonucleotides. Since Affymetrix (Santa Clara, Calif.) sells custom made microarrays, microarrays containing genes which are up- or down-regulated with cognitive impairments can be ordered for purchase from Affymetrix (Santa Clara, Calif.).

[0183] Microarrays can also be prepared by mechanical microspotting, e.g., those commercialized at Synteni (Fremont, Calif.). According to these methods, small quantities of nucleic acids are printed onto solid surfaces. Microspotted arrays prepared at Synteni contain as many as 10,000 groups of cDNA in an area of about 3.6 cm².

[0184] A third group of microarray technologies consist in the “drop-on-demand” delivery approaches, the most advanced of which are the ink-jetting technologies, which utilize piezoelectric and other forms of propulsion to transfer nucleic acids from miniature nozzles to solid surfaces. Inkjet technologies is developed at several centers including Incyte Pharmaceuticals (Palo Alto, Calif.) and Protogene (Palo Alto, Calif.). This technology results in a density of 10,000 spots per cm². See also, Hughes et al., Nat. Biotechn. 19:342, 2001.

[0185] Arrays preferably include control and reference nucleic acids. Control nucleic acids are nucleic acids which serve to indicate that the hybridization was effective. For example, all , Affymetrix (Santa Clara, Calif.) expression arrays contain sets of probes for several prokaryotic genes, e.g., bioB, bioC and bioD from biotin synthesis of E. coli and cre from P1 bacteriophage. Hybridization to these arrays is conducted in the presence of a mixture of these genes or portions thereof, such as the mix provided by Affymetrix (Santa Clara, Calif.) to that effect (Part Number 900299), to thereby confirm that the hybridization was effective. Control nucleic acids included with the target nucleic acids can also be MRNA synthesized from cDNA clones by in vitro transcription. Other control genes that may be included in arrays are polyA controls, such as dap, lys, phe, thr, and trp (which are included on Affymetrix GeneChips®)

[0186] Reference nucleic acids allow the normalization of results from one experiment to another, and to compare multiple experiments on a quantitative level. Exemplary reference nucleic acids include housekeeping genes of known expression levels, e.g., glyceraldehyde-3-phosphate dehydrogenase (GAPDH), hexokinase and actin.

[0187] Mismatch controls may also be provided for the probes to the target genes, for expression level controls or for normalization controls. Mismatch controls are oligonucleotide probes or other nucleic acid probes identical to their corresponding test or control probes except for the presence of one or more mismatched bases.

[0188] Arrays may also contain probes that hybridize to more than one allele of a gene. For example the array can contain one probe that recognizes allele 1 and another probe that recognizes allele 2 of a particular gene.

[0189] Microarrays can be prepared as follows. In one embodiment, an array of oligonucleotides is synthesized on a solid support. Exemplary solid supports include glass, plastics, polymers, metals, metalloids, ceramics, organics, etc. Using chip masking technologies and photoprotective chemistry it is possible to generate ordered arrays of nucleic acid probes. These arrays, which are known, e.g., as “DNA chips,” or as very large scale immobilized polymer arrays (“VLSIPSTM” arrays) can include millions of defined probe regions on a substrate having an area of about 1 cm² to several cm², thereby incorporating sets of from a few to millions of probes (see, e.g., U.S. Pat. No. 5,631,734).

[0190] The construction of solid phase nucleic acid arrays to detect target nucleic acids is well described in the literature. See, Fodor et al., Science, 251: 767-777, 1991; Sheldon et al., Clinical Chemistry 39(4): 718-719, 1993; Kozal et al., Nature Medicine 2(7): 753-759, 1996 and Hubbell U.S. Pat. No. 5,571,639; Pinkel et al. PCT/US95/16155 (WO 96/17958); U.S. Pat. Nos. 5,677,195; 5,624,711; 5,599,695; 5,451,683; 5,424,186; 5,412,087; 5,384,261; 5,252,743 and 5,143,854; PCT Patent Publication Nos. 92/10092 and 93/09668; and PCT WO 97/10365. In brief, a combinatorial strategy allows for the synthesis of arrays containing a large number of. probes using a minimal number of synthetic steps. For instance, it is possible to synthesize and attach all possible DNA 8 mer oligonucleotides (48, or 65,536 possible combinations) using only 32 chemical synthetic steps. In general, VLSIPSTM procedures provide a method of producing 4n different oligonucleotide probes on an array using only 4n synthetic steps (see, e.g., U.S. Pat. No. 5,631,734 5; 143,854 and PCT Patent Publication Nos. WO 90/15070; WO 95/11995 and WO 92/10092).

[0191] Light-directed combinatorial synthesis of oligonucleotide arrays on a glass surface can be performed with automated phosphoramidite chemistry and chip masking techniques similar to photoresist technologies in the computer chip industry. Typically, a glass surface is derivatized with a silane reagent containing a functional group, e.g., a hydroxyl or amine group blocked by a photolabile protecting group. Photolysis through a photolithogaphic mask is used selectively to expose functional groups which are then ready to react with incoming 5′-photoprotected nucleoside phosphoramidites. The phosphoramidites react only with those sites which are illuminated (and thus exposed by removal of the photolabile blocking group). Thus, the phosphoramidites only add to those areas selectively exposed from the preceding step. These steps are repeated until the desired array of sequences have been synthesized on the solid surface.

[0192] Algorithms for design of masks to reduce the number of synthesis cycles are described by Hubbel et al., U.S. Pat. No. 5,571,639 and U.S. Pat. No. 5,593,839. A computer system may be used to select nucleic acid probes on the substrate and design the layout of the array as described in U.S. Pat. No. 5,571,639.

[0193] Another method for synthesizing high density arrays is described in U.S. Pat. No. 6,083,697. This method utilizes a novel chemical amplification process using a catalyst system which is initiated by radiation to assist in the synthesis the polymer sequences. Such methods include the use of photosensitive compounds which act as catalysts to chemically alter the synthesis intermediates in a manner to promote formation of polymer sequences. Such photosensitive compounds include what are generally referred to as radiation-activated catalysts (RACs), and more specifically photo activated catalysts (PACs). The RACs can by themselves chemically alter the synthesis intermediate or they can activate an autocatalytic compound which chemically alters the synthesis intermediate in a manner to allow the synthesis intermediate to chemically combine with a later added synthesis intermediate or other compound.

[0194] Arrays can also be synthesized in a combinatorial fashion by delivering monomers to cells of a support by mechanically constrained flowpaths. See Winkler et al., EP 624,059. Arrays can also be synthesized by spotting monomers reagents on to a support using an ink jet printer. See id. and Pease et al., EP 728,520.

[0195] cDNA probes can be prepared according to methods known in the art and further described herein, e.g., reverse-transcription PCR (RT-PCR) of RNA using sequence specific primers. Oligonucleotide probes can be synthesized chemically. Sequences of the genes or cDNA from which probes are made can be obtained, e.g., from GenBank, other public databases or publications.

[0196] Nucleic acid probes can be natural nucleic acids, chemically modified nucleic acids, e.g., composed of nucleotide analogs, as long as they have activated hydroxyl groups compatible with the linking chemistry. The protective groups can, themselves, be photolabile. Alternatively, the protective groups can be labile under certain chemical conditions, e.g., acid. In this example, the surface of the solid support can contain a composition that generates acids upon exposure to light. Thus, exposure of a region of the substrate to light generates acids in that region that remove the protective groups in the exposed region. Also, the synthesis method can use 3′-protected 5′—0-phosphoramidite-activated deoxynucleoside. In this case, the oligonucleotide is synthesized in the 5′ to 3′ direction, which results in a free 5′ end.

[0197] Oligonucleotides of an array can be synthesized using a 96 well automated multiplex oligonucleotide synthesizer (A.M.O.S.) that is capable of making thousands of oligonucleotides (Lashkari et al., PNAS 93: 7912, 1995).

[0198] It will be appreciated that oligonucleotide design is influenced by the intended application. For example, it may be desirable to have similar melting temperatures for all of the probes. Accordingly, the length of the probes are adjusted so that the melting temperatures for all of the probes on the array are closely similar (it will be appreciated that different lengths for different probes may be needed to achieve a particular T[m] where different probes have different GC contents). Although melting temperature is a primary consideration in probe design, other factors are optionally used to further adjust probe construction, such as selecting against primer self-complementarity and the like.

[0199] Arrays, e.g., microarrrays, may conveniently be stored following fabrication or purchase for use at a later time. Under appropriate conditions, the subject arrays are capable of being stored for at least about 6 months and may be stored for up to one year or longer. Arrays are generally stored at temperatures between about −20° C. to room temperature, where the arrays are preferably sealed in a plastic container, e.g. bag, and shielded from light.

[0200] 6.3.3 Hybridizing the Target Nucleic Acid to the Microarray

[0201] The next step is to contact the target nucleic acids with the array under conditions sufficient for binding between the target nucleic acids and the probes of the array. In a preferred embodiment, the target nucleic acids will be contacted with the array under conditions sufficient for hybridization to occur between the target nucleic acids and probes on the microarray, where the hybridization conditions will be selected in order to provide for the desired level of hybridization specificity.

[0202] Contact of the array and target nucleic acids involves contacting the array with an aqueous medium comprising the target nucleic acids. Contact may be achieved in a variety of different ways depending on specific configuration of the array. For example, where the array simply comprises the pattern of size separated probes on the surface of a “plate-like” rigid substrate, contact may be accomplished by simply placing the array in a container comprising the target nucleic acid solution, such as a polyethylene bag, and the like. In other embodiments where the array is entrapped in a separation media bounded by two rigid plates, the opportunity exists to deliver the target nucleic acids via electrophoretic means. Alternatively, where the array is incorporated into a biochip device having fluid entry and exit ports, the target nucleic acid solution can be introduced into the chamber in which the pattern of target molecules is presented through the entry port, where fluid introduction could be performed manually or with an automated device. In multiwell embodiments, the target nucleic acid solution will be introduced in the reaction chamber comprising the array, either manually, e.g. with a pipette, or with an automated fluid handling device.

[0203] Contact of the target nucleic acid solution and the probes will be maintained for a sufficient period of time for binding between the target and the probe to occur. Although dependent on the nature of the probe and target, contact will generally be maintained for a period of time ranging from about 10 min to 24 hrs, usually from about 30 min to 12 hrs and more usually from about 1 hr to 6 hrs.

[0204] When using commercially available microarrays, adequate hybridization conditions are provided by the manufacturer. When using non-commercial microarrays, adequate hybridization conditions can be determined based on the following hybridization guidelines, as well as on the hybridization conditions described in the numerous published articles on the use of microarrays.

[0205] Nucleic acid hybridization and wash conditions are optimally chosen so that the probe “specifically binds” or “specifically hybridizes” to a specific array site, i.e., the probe hybridizes, duplexes or binds to a sequence array site with a complementary nucleic acid sequence but does not hybridize to a site with a non-complementary nucleic acid sequence. As used herein, one polynucleotide sequence is considered complementary to another when, if the shorter of the polynucleotides is less than or equal to 25 bases, there are no mismatches using standard base-pairing rules or, if the shorter of the polynucleotides is longer than 25 bases, there is no more than a 5% mismatch. Preferably, the polynucleotides are perfectly complementary (no mismatches). It can easily be demonstrated that specific hybridization conditions result in specific hybridization by carrying out a hybridization assay including negative controls.

[0206] Hybridization is carried out in conditions permitting essentially specific hybridization. The length of the probe and GC content will determine the Tm of the hybrid, and thus the hybridization conditions necessary for obtaining specific hybridization of the probe to the template nucleic acid. These factors are well known to a person of skill in the art, and can also be tested in assays. An extensive guide to the hybridization of nucleic acids is found in Tijssen (1993), “Laboratory Techniques in biochemistry and molecular biology-hybridization with nucleic acid probes.” Generally, stringent conditions are selected to be about 5° C. lower than the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The Tm is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Highly stringent conditions are selected to be equal to the Tm point for a particular probe. Sometimes the term “Td” is used to define the temperature at which at least half of the probe dissociates from a perfectly matched target nucleic acid. In any case, a variety of estimation techniques for estimating the Tm or Td are available, and generally described in Tijssen, supra. Typically, G-C base pairs in a duplex are estimated to contribute about 3° C. to the Tm, while A-T base pairs are estimated to contribute about 2° C., up to a theoretical maximum of about 80-100° C. However, more sophisticated models of Tm and Td are available and appropriate in which G-C stacking interactions, solvent effects, the desired assay temperature and the like are taken into account. For example, probes can be designed to have a dissociation temperature (Td) of approximately 60° C., using the formula: Td=(((((3 ×#GC)+(2×#AT))×37)−562)/#bp)−5; where #GC, #AT, and #bp are the number of guanine-cytosine base pairs, the number of adenine-thymine base pairs, and the number of total base pairs, respectively, involved in the annealing of the probe to the template DNA.

[0207] The stability difference between a perfectly matched duplex and a mismatched duplex, particularly if the mismatch is only a single base, can be quite small, corresponding to a difference in Tm between the two of as little as 0.5 degrees (See Tibanyenda, N. et al., Eur. J. Biochem. 139:19, 1984 and Ebel, S. et al., Biochem. 31:12083, 1992). More importantly, it is understood that as the length of the homology region increases, the effect of a single base mismatch on overall duplex stability decreases.

[0208] Theory and practice of nucleic acid hybridization is described, e.g., in S. Agrawal (ed.) Methods in Molecular Biology, volume 20; and Tijssen (1993) “Laboratory Techniques in biochemistry and molecular biology-hybridization with nucleic acid probes”, e.g., part I chapter 2 “Overview of principles of hybridization and the strategy of nucleic acid probe assays”, Elsevier, New York provide a basic guide to nucleic acid hybridization.

[0209] Certain microarrays are of “active” nature, i.e., they provide independent electronic control over all aspects of the hybridization reaction (or any other affinity reaction) occurring at each specific microlocation. These devices provide a new mechanism for affecting hybridization reactions which is called electronic stringency control (ESC). Such active devices can electronically produce “different stringency conditions” at each microlocation. Thus, all hybridizations can be carried out optimally in the same bulk solution. These arrays are described in Sosnowski et al., U.S. Pat. No. 6,051,380.

[0210] In a preferred embodiment, background signal is reduced by the use of a detergent (e.g, C-TAB) or a blocking reagent (e.g., sperm DNA, cot-1DNA, etc.) during the hybridization to reduce non-specific binding. In a particularly preferred (embodiment, the hybridization is performed in the presence of about 0.5 mg/ml DNA (e.g., herring sperm DNA). The use of blocking agents in hybridization is well known to those of skill in the art (see, e.g., Chapter 8 in Laboratory Techniques in Biochemistry and Molecular Biology, Vol. 24: Hybridization With Nucleic Acid Probes, P. Tijssen, ed. Elsevier, N.Y., (1993)).

[0211] The method may or may not further comprise a non-bound label removal step prior to the detection step, depending on the particular label employed on the target nucleic acid. For example, in certain assay formats (e.g., “homogenous assay formats”) a detectable signal is only generated upon specific binding of target to probe. As such, in these assay formats, the hybridization pattern may be detected without a non-bound label removal step. In other embodiments, the label employed will generate a signal whether or not the target is specifically bound to its probe. In such embodiments, the non-bound labeled target is removed from the support surface. One means of removing the non-bound labeled target is to perform the well known technique of washing, where a variety of wash solutions and protocols for their use in removing non-bound label are known to those of skill in the art and may be used. Alternatively, non-bound labeled target can be removed by electrophoretic means.

[0212] Where all of the target sequences are detected using the same label, different arrays will be employed for each physiological source (where different could include using the same array at different times). The above methods can be varied to provide for multiplex analysis, by employing different and distinguishable labels for the different target populations (representing each of the different physiological sources being assayed). According to this multiplex method, the same array is used at the same time for each of the different target populations.

[0213] In another embodiment, hybridization is monitored in real time using a charge-coupled device (CCD) imaging camera (Guschin et al., Anal. Biochem. 250:203, 1997). Synthesis of arrays on optical fibre bundles allows easy and sensitive reading (Healy et al., Anal. Biochem. 251:270, 1997). In another embodiment, real time hybridization detection is carried out on microarrays without washing using evanescent wave effect that excites only fluorophores that are bound to the surface (see, e.g., Stimpson et al., PNAS 92:6379, 1995).

[0214] 6.3.4 Detecting Hybridized Nucleic Acids and Analyzing the Results from the Microarray

[0215] The above steps result in the production of hybridization patterns of target nucleic acid on the array surface. These patterns may be visualized or detected in a variety of ways, with the particular manner of detection being chosen based on the particular label of the target nucleic acid. Representative detection means include scintillation counting, autoradiography, fluorescence measurement, calorimetric measurement, light emission measurement, light scattering, and the like.

[0216] One method of detection includes an array scanner that is commercially available from Affymetrix (Santa Clara, Calif.), e.g., the 417TM Arrayer, the 418TM Array Scanner, or the Agilent GeneArrayTM Scanner. This scanner is controlled from the system computer with a WindowsR interface and easy-to-use software tools. The output is a 16-bit.tif file that can be directly imported into or directly read by a variety of software applications. Preferred scanning devices are described in, e.g., U.S. Pat. Nos. 5,143,854 and 5,424,186.

[0217] When fluorescently labeled probes are used, the fluorescence emissions at each site of a transcript array can be detected by scanning confocal laser microscopy. In one embodiment, a separate scan, using the appropriate excitation line, is carried out for each of the two fluorophores used. Alternatively, a laser can be used that allows simultaneous specimen illumination at wavelengths specific to the two fluorophores and emissions from the two fluorophores can be analyzed simultaneously (see Shalon et al., Genome Research 6:639-645, 1996). In a preferred embodiment, the arrays are scanned with a laser fluorescent scanner with a computer controlled X-Y stage and a microscope objective. Sequential excitation of the two fluorophores can be achieved with a multi-line, mixed gas laser and the emitted light is split by wavelength and detected with two photomultiplier tubes. In one embodiment in which fluorescent target nucleic acids are used, the arrays may be scanned using lasers to excite fluorescently labeled targets that have hybridized to regions of probe arrays, which can then be imaged using charged coupled devices (“CCDs”) for a wide field scanning of the array. Fluorescence laser scanning devices are described, e.g., in Schena et al., supra. Alternatively, the fiber-optic bundle described by Ferguson et al., Nature Biotech. 14:1681-1684, 1996, may be used to monitor MRNA abundance levels.

[0218] Following the data gathering operation, the data will typically be reported to a data analysis operation. To facilitate the sample analysis operation, the data obtained by the reader from the device will typically be analyzed using a digital computer. Typically, the computer will be appropriately programmed for receipt and storage of the data from the device, as well as for analysis and reporting of the data gathered, e.g., subtraction of the background, deconvolution of multi-color images, flagging or removing artifacts, verifying that controls have performed properly, normalizing the signals, interpreting fluorescence data to determine the amount of hybridized target, normalization of background and single base mismatch hybridizations, and the like. In a preferred embodiment, a system comprises a search function that allows one to search for specific patterns, e.g., patterns relating to differential gene expression, e.g., between the expression profile of a sample from a patient with cognitive impairments and the expression profile of a counterpart normal subject. A system preferably allows one to search for patterns of gene expression between more than two samples.

[0219] A desirable system for analyzing data is a general and flexible system for the visualization, manipulation, and analysis of gene expression data. Such a system preferably includes a graphical user interface for browsing and navigating through the expression data, allowing a user to selectively view and highlight the genes of interest. The system also preferably includes sort and search functions and is preferably available for general users with PC, Mac or Unix workstations. Also preferably included in the system are clustering algorithms that are qualitatively more efficient than existing ones. The accuracy of such algorithms is preferably hierarchically adjustable so that the level of detail of clustering can be systematically refined as desired.

[0220] Various algorithms are available for analyzing the gene expression profile data, e.g., the type of comparisons to perform. In certain embodiments, it is desirable to group genes that are co-regulated. This allows the comparison of large numbers of profiles. A preferred embodiment for identifying such groups of genes involves clustering algorithms (for reviews of clustering algorithms, see, e.g., Fukunaga, 1990, Statistical Pattern Recognition, 2nd Ed., Academic Press, San Diego; Everitt, 1974, Cluster Analysis, London: Heinemann Educ. Books; Hartigan, 1975, Clustering Algorithms, New York: Wiley; Sneath and Sokal, 1973, Numerical Taxonomy, Freeman; Anderberg, 1973, Cluster Analysis for Applications, Academic Press: New York).

[0221] Clustering analysis is useful in helping to reduce complex patterns of thousands of time curves into a smaller set of representative clusters. Some systems allow the clustering and viewing of genes based on sequences. Other systems allow clustering based on other characteristics of the genes, e.g., their level of expression (see, e.g., U.S. Pat. No. 6,203,987). Other systems permit clustering of time curves (see, e.g. U.S. Pat. No. 6,263,287). Cluster analysis can be performed using the hclust routine (see, e.g., “hclust” routine from the software package S-Plus, MathSoft, Inc., Cambridge, Mass.).

[0222] In some specific embodiments, genes are grouped according to the degree of co-variation of their transcription, presumably co-regulation, as described in U.S. Pat. No. 6,203,987. Groups of genes that have co-varying transcripts are termed “genesets.” Cluster analysis or other statistical classification methods can be used to analyze the co-variation of transcription of genes in response to a variety of perturbations, e.g. caused by a disease or a drug. In one specific embodiment, clustering algorithms are applied to expression profiles to construct a “similarity tree” or “clustering tree” which relates genes by the amount of co-regulation exhibited. Genesets are defined on the branches of a clustering tree by cutting across the clustering tree at different levels in the branching hierarchy.

[0223] In some embodiments, a gene expression profile is converted to a projected gene expression profile. The projected gene expression profile is a collection of geneset expression values. The conversion is achieved, in some embodiments, by averaging the level of expression of the genes within each geneset. In some other embodiments, other linear projection processes may be used. The projection operation expresses the profile on a smaller and biologically more meaningful set of coordinates, reducing the effects of measurement errors by averaging them over each cellular constituent sets and aiding biological interpretation of the profile.

[0224] Values that can be compared include gross expression levels; averages of expression levels, e.g., from different experiments, different samples from the same subject or samples from different subjects; and ratios of expression levels.

[0225] 6.3.5 Data Analysis Methods for the Microarray

[0226] Comparison of the expression levels of one or more genes which are up-regulated in response to the inhibition of cognitive impairment with reference to expression levels in the absence of inhibition of cognitive impairment, e.g., expression levels characteristic of a disease or in normal subject, is preferably conducted using computer systems. In one embodiment, one or more expression levels are obtained from two samples and these two sets of expression levels are introduced into a computer system for comparison. In a preferred embodiment, one set of one or more expression levels is entered into a computer system for comparison with values that are already present in the computer system, or in computer-readable form that is then entered into the computer system.

[0227] In one embodiment, the invention provides a computer readable form of the gene expression profile data of the invention, or of values corresponding to the level of expression of at least one gene which is up-regulated in response to inhibition of cognitive impairment in a subject. The values can be MRNA expression levels obtained from experiments, e.g., microarray analysis. The values can also be MRNA levels normalized relative to a reference gene whose expression is constant in numerous cells under numerous conditions, e.g., GAPDH. In other embodiments, the values in the computer are ratios of, or differences between, normalized or non-normalized mRNA levels in different samples.

[0228] The computer readable medium may comprise values of at least 2, at least 3, at least 5, 10, 20, 50, 100, 200, 500 or more genes. In a preferred embodiment, the computer readable medium comprises at least one expression profile.

[0229] Gene expression data can be in the form of a table, such as an Excel table. The data can be alone, or it can be part of a larger database, e.g., comprising other expression profiles, e.g., publicly available database. The computer readable form can be in a computer. In another embodiment, the invention provides a computer displaying the gene expression profile data.

[0230] The invention provides methods in which the level of expression of a single gene can be compared in two or more cells or tissue samples. In some embodiments, the level of expression of a plurality of genes is compared. For example, the level of expression of at least 2, at least 3, at least 5, 10, 20, 50, 100, 200, 500 or more genes. In an embodiment, expression profiles are compared.

[0231] In one embodiment, the invention provides a method for determining the similarity between the level of expression of one or more genes which are up-regulated in response to inhibition of cognitive impairment. The method preferably comprises obtaining the level of expression of one or more genes which are up-regulated in response to inhibition of cognitive impairment in a first sample and entering these values into a computer comprising (i) a database including records comprising values corresponding to levels of expression of one or more genes in a control untreated sample, and (ii) processor instructions, e.g., a user interface, capable of receiving a selection of one or more values for comparison purposes with data that is stored in the computer. The computer may further comprise a means for converting the comparison data into a diagram or chart or other type of output.

[0232] In one embodiment, the invention provides a system that comprises a means for receiving gene expression data for one or a plurality of genes; a means for comparing the gene expression data from each of said one or plurality of genes to a common reference frame; and a means for presenting the results of the comparison. This system may further comprise a means for clustering the data.

[0233] In another embodiment, the invention provides a computer program for analyzing gene expression data comprising (i) a computer code that receives as input gene expression data for a plurality of genes and (ii) a computer code that compares said gene expression data from each of said plurality of genes to a common reference frame.

[0234] The invention also provides a machine-readable or computer-readable medium including program instructions for performing the following steps: (i) comparing a plurality of values corresponding to expression levels of one or more genes which are up—regulated in response to inhibition of NMD in a query cell with a database including records comprising reference expression of one or more reference cells and an annotation of the type of cell; and (ii) indicating to which cell the query cell is most similar based on similarities of expression levels.

[0235] The relative levels of expression, e.g., abundance of an mRNA, in two biological samples can be scored as a perturbation (relative abundance difference) or as not perturbed (i.e., the relative abundance is the same). For example, a perturbation can be a difference in expression levels between the two sources of RNA of at least a factor of about 25% (RNA from one source is 25% more abundant in one source than the other source), more usually about 50%, even more often by a factor of about 2 (twice as abundant), 3 (three times as abundant) or 5 (five times as abundant). Perturbations can be used by a computer for calculating and expressing comparisons.

[0236] Preferably, in addition to identifying a perturbation as positive or negative, it is advantageous to determine the magnitude of the perturbation. This can be carried out, as noted above, by calculating the ratio of the emission of the two fluorophores used for differential labeling, or by analogous methods that will be readily apparent to those of skill in the art.

[0237] The computer readable medium may further comprise a pointer to a descriptor of the level of expression or expression profile, e.g., from which source it was obtained, e.g., from which patient it was obtained. A descriptor can reflect the stage of disease, the therapy that the patient is undergoing or any other descriptions of the source of expression levels.

[0238] In operation, the means for receiving gene expression data, the means for comparing the gene expression data, the means for presenting, the means for normalizing, and the means for clustering within the context of the systems of the present invention can involve a programmed computer with the respective functionalities described herein, implemented in hardware or hardware and software; a logic circuit or other component of a programmed computer that performs the operations specifically identified herein, dictated by a computer program; or a computer memory encoded with executable instructions representing a computer program that can cause a computer to function in the particular fashion described herein.

[0239] Those skilled in the art will understand that the systems and methods of the present invention may be applied to a variety of systems, including IBM-compatible personal computers running MS-DOS or Microsoft Windows. Additionally the personal computer would have all of the hardware and software components normally associated with such a system such that the user would have capable memory, network connectivity, printing capability and programming capability with various computer languages. With the proper computer system the user could first load expression profile data into the computer system, U.S. Pat. No. 6,203,987. Geneset profile definitions are loaded into the memory from the storage media or from a remote computer, preferably from a dynamic geneset database system, through the network. Next the user causes execution of projection software which performs the steps of converting expression profile to projected expression profiles. The projected expression profiles are then displayed.

[0240] In yet another exemplary implementation, a user first leads a projected profile into the memory. The user then causes the loading of a reference profile into the memory. Next, the user causes the execution of comparison software which performs the steps of objectively comparing the profiles.

7. Screening for Compounds that Promote or Preserve Cognitive Function

[0241] Guided by the present disclosure, agents that modulate the expression of genes associated with cognitive function can be identified using in vitro and in vivo screening methods of the invention.

[0242] The invention provides methods for identification of agents useful for promoting or preserving cognitive function in mammals, e.g., rats and humans. In one aspect, the screening methods involve conducting assays to identify agents that modulate the expression of a gene encoding a glutamate transporter protein, e.g., an EAAT1, EAAT2, EAAT3, EAAT4 or EAAT5, or the activity of a glutamate transporter protein encoded by such a gene. For simplicity, reference below to “EAAT” is intended to refer to each of EAAT1, EAAT2, EAAT3, EAAT4 and EAAT5 individually, to the group comprising all of the genes/proteins, and to all subcombinations (e.g., EAAT1 and EAAT2). Alternatively, the screening methods involve conducting assays to identify agents that modulate the expression of aspartate aminotransferase.

[0243] A number of different screening protocols can be utilized to identify agents that modulate the level of expression of EAAT and/or aspartate aminotransferase in mammalian cells (e.g., rat cells, non-human primate cells or human cells). In general terms, the screening methods involve screening a plurality of agents (“test agents”) to identify an agent that changes the activity or level of EAAT by, for example without limitation, binding to an EAAT polypeptide, preventing an inhibitor from binding to an EAAT polypeptide, or increasing expression of an EAAT gene. Moreover, the screening methods involve screening a plurality of agents to identify an agent that changes the activity or level of aspartate aminotransferase by, for example without limitation, binding to an aspartate aminotransferase polypeptide, preventing an inhibitor from binding to an aspartate aminotransferasepolypeptide, or increasing expression of an aspartate aminotransferase gene.

[0244] “Test agents” include compounds of a variety of general types including, but not limited to, small organic molecules, known pharmaceuticals, polypeptides; carbohydrates such as oligosaccharides and polysaccharides; polynucleotides; lipids or phospholipids; fatty acids; steroids; or amino acid analogs. Test agents can be obtained from libraries, such as natural product libraries and combinatorial libraries. A number of different types of libraries are commercially available and methods for preparing libraries have been described, including for example, PCT publications WO 93/06121, WO 95/12608, WO 95/35503, WO 94/08051 and WO 95/30642. In addition, methods of automating assays are known that permit screening of several thousands of compounds in a short period.

[0245] Certain screening methods involve screening for a compound that increases the expression or activity of an EAAT and/or an aspartate aminotransferase protein in a cell. Such methods can involve conducting cell-based assays in which test compounds are contacted with one or more cells expressing an EAAT gene or protein and then detecting a change in EAAT expression (e.g., levels of EAAT RNA) or activity. Another method can involve conducting cell-based assays in which test compounds are contacted with one or more cells expressing an aspartate aminotransferase gene or protein and then detecting a change in aspartate aminotransferase expression (e.g., levels of aspartate aminotransferase RNA) or activity. Thus, in an embodiment the method comprises contacting a cell with a test agent and determining whether the level of expression of the gene is changed in the presence of the test agent, where a change (e.g., increase) in expression is an indication that the test agent is useful for promoting or preserving cognitive function. Cells can be contacted in vitro, in vivo or ex vivo. Typically expression is increased by at least about 10%, at least about 20%, at least about 50%, at least about 75%, or at least about 100% compared to expression in the absence of the test compound.

[0246] In an embodiment, the invention provides a method of screening for an agent to determine its usefulness for reduction of cognitive impairment by providing a cell expressing a glutamate transporter or aspartate aminotransferase gene expressed by mammalian neural cells, contacting the cell with a test agent; and determining whether the activity or level of expression of the glutamate transporter (e.g., one or more of EAAT1, EAAT2, EAAT3, EAAT4 and EAAT5) and/or aspartate aminotransferase (AT) is increased in the presence of the test agent, where such an increase is an indication that the test agent is useful in promoting or preserving cognitive function. Expression can be assessed by art known methods including detecting changes in the rate or abundance of EAAT or AT mRNA. Glutamate transporter protein activity can be assessed by art-known methods, including measuring the uptake of ³H-glutamate uptake into cells (Lin et al., Nature 410: 84-88, 2001). Aspartate aminotransferase protein activity may be assess by art-known methods, including in a coupled reaction with malate dehydrogenase in the presence of NADH (Karmen, J Clin Invest 34:131, 1955; Amador and Wacker, Clin Chem 8:343, 1962).

[0247] Usually this determination comprises comparing the activity or expression in the test cell compared to a similar cell or cells (i.e., control cells) that have not been contacted with the test compound. In a related embodiment, the test compound is administered to a multicellular organism (e.g., animal). The EAAT or aspartate aminotransferase component may be wholly endogenous to the cell or multicellular organism or may be a recombinant cell or transgenic organism comprising one or more recombinantly expressed EAAT and/or aspartate aminotransferase proteins. Expression of recombinant EAAT and/or aspartate aminotransferase proteins can be accomplished using published gene and protein sequences and routine methods (see, e.g., Ausubel et al., Current Protocols In Molecular Biology, Greene Publishing and Wiley-Interscience, New York (supplemented through 2002).

[0248] The assays can be carried out using any cell type that expresses an EAAT and/or aspartate aminotransferase gene including, in various embodiments, a cultured cell (e.g., a cell in a primary culture or an established cell line) and a cell in vivo. Exemplary cells include neurons, glia cells, mixed neuronal cultures or cells in which EEAT and/or aspartate aminotransferase gene expression is induced by recombinant expression. Such cells (e.g., primary cultures) can be obtained from fetal hippocampus. Many other suitable cells or cell lines will be known to the practitioner.

[0249] The effect of an agent on EAAT and/or aspartate aminotransferase gene expression in a cell or in vitro system can be compared to a baseline value, which is typically the level of expression by the cell or in vitro system in the absence of the test agent. Expression levels can also be determined for cells that do not express EAAT and/or aspartate aminotransferase as a negative control. Such cells generally are otherwise substantially genetically the same as the test cells.

[0250] Other cell-based assays are reporter assays conducted with cells that do not necessarily express an EAAT and/or aspartate aminotransferase. Certain of these assays are conducted with a heterologous nucleic acid construct that includes an EAAT or aspartate aminotransferase gene promoter that is operably linked to a reporter gene that encodes a detectable product. EAAT gene promoters are located, in most cases, within a region about 300 to 1000 bp upstream (or 5′) of the transcription start sites and are described in, for exampe, Su et al., PNAS 100:1955-1960, 2003. Aspartate aminotransferase gene promoters are located, in most cases, within a region about 300 to 1000 bp upstream (or 5′) of the transcription start sites and are described in, for exampe Obaru et al., J Mol Biol. 200:13-22, 1988. Certain EAAT and aspartate aminotransferase gene promoters are described in GenBank (http://www.ncbi.nlm.nih.gov/) and the scientific literature. A number of different reporter genes can be utilized. Exemplary reporters include green fluorescent protein, J-glucuronidase, chloramphenicol acetyl transferase, luciferase, J-galactosidase, alkaline phosphatase, and the like. In these assays, cells harboring the reporter construct are contacted with a test compound. A test compound that either activates the promoter by binding to it or triggers a cascade that produces a molecule that activates the promoter causes expression of the detectable reporter. A variety of different types of cells can be utilized in the reporter assays (e.g., eukaryotic cells such as yeast, COS, CHO, HepG2, and HeLa cell lines).

[0251] Identification of agents that increase activity of the EAAT or aspartate aminotransferase protein can also include screening for compounds capable of binding to an EAAT or aspartate aminotransferase protein, as at least some of the compounds so identified are likely EAAT or aspartate aminotransferase modulators. Lead compounds identified during these screens can serve as the basis for the synthesis of more active analogs. Thus, in one aspect, the invention provides a method of screening for an agent to determine its usefulness in reduction of cognitive impairment by (a) contacting a polypeptide encoded by an EAAT or aspartate aminotransferase gene, or a cell expressing such a polypeptide with a test compound, and (b) determining whether the polypeptide binds to the test compound. Such binding is an indication that the test agent is useful in reduction of cognitive impairment. The binding assays usually involve contacting an EAAT or aspartate aminotransferase polypeptide with one or more test compounds and allowing sufficient time for the protein and test compounds to form a binding complex. Determining the ability of the test compound to directly bind to an EAAT or aspartate aminotransferase polypeptide can be accomplished, for example, by coupling the compound to a radioisotope or enzymatic label such that binding of the compound to the EAAT or aspartate aminotransferase polypeptide can be determined by detecting the labeled EAAT or aspartate aminotransferase polypeptide in a complex. Any binding complexes formed can be detected using any of a number of established analytical techniques. Protein binding assays include, but are not limited to, methods that measure co-precipitation, co-migration on non-denaturing SDS-polyacrylamide gels, and co-migration on Western blots (see, e.g., E.C. Hulme, 1992, “Receptor-Ligand Interactions” in A Practical Approach/The Practical Approach Series (Series Eds D. Rickwood and B D Hames) IRL Press at Oxford University Press). The EAAT (or aspartate aminotransferase) polypeptide utilized in such assays can be purified or recombinant. As noted above, recombinant expression and purification of EAAT (or aspartate aminotransferase) proteins can be accomplished using routine methods.

[0252] The EAAT or aspartate aminotransferase proteins can, in vivo, interact with one or more cellular and extracellular molecules (such as, without limitation, peptides, proteins, hormones, cofactors and nucleic acids) herein referred to as “binding partners.” Methods are known for identify its natural in vivo binding partners of EAATs, e.g., two and three-hybrid assays (see, e.g., U.S. Pat. No. 5,283,317; Zervos et al, 1993, Cell 72:223-232; Madura et al, 1993, J. Biol. Chem. 268:12046-12054; Bartel et al, 1993, Biotechniques 14:920-924; Iwabuchi et al, 1993 Oncogene 8:1693-1696; Brent WO94/10300). Such EAAT or aspartate aminotransferase protein binding partners may be involved in the propagation of signals by the EAAT o r aspartate aminotransferase protein or downstream elements of an EAAT or aspartate aminotransferase protein-mediated signaling pathway, or, alternatively, may be found to be inhibitors of the EAAT or aspartate aminotransferase protein. Art-known assays can be devised through the use of the invention to identify compounds that modulate (e.g., affect either positively or negatively) interactions between an EAAT or aspartate aminotransferase protein and its binding partners. Typically, the assay for compounds that interfere with the interaction between the EAAT or aspartate aminotransferase protein and its binding partner involves preparing a reaction mixture containing the EAAT or aspartate aminotransferase protein and its binding partner under conditions and for a time sufficient to allow the two products to interact and bind, thus forming a complex. In order to test an agent for inhibitory activity, the reaction mixture is prepared in the presence and absence of the test compound. Also within the scope of the present invention are methods for direct detection of interactions between the EAAT or aspartate aminotransferase protein and its natural binding partner and/or a test compound in a homogeneous or heterogeneous assay system without further sample manipulation. For example, the technique of fluorescence energy transfer may be utilized (see, e.g., Lakowicz et al, U.S. Pat. No. 5,631,169; Stavrianopoulos et al, U.S. Pat. No. 4,868,103).

[0253] In one aspect, agents identified by assay(s) described above can be administered to experimental animals to measure their cognition promoting and preserving activities (see, e.g., Example, infra).

[0254] In one aspect, the invention features a method of screening compounds for utility in promoting cognitive function of a mammal by administering a test compound to a mammal, determining the level of expression of one or more EAAT or AT gene(s) in neural tissue of the mammal following administration of said test compound, comparing the level of expression of the gene(s) to a reference level of expression in neural tissue of a mammal to which the test compound was not administered and determining whether the level of expression of the gene differs from the corresponding reference level, where a difference indicates that the test compound is a candidate therapeutic agent for promoting cognitive function. The method may also include a further step of comparing the level of expression of a gene to reference level of expression in neural tissue of a mammal to whom ceftriaxone or valproic acid was administered. In one embodiment, the mammal is a rat, such as an aged rat.

8. Therapeutic Methods and Compositions

[0255] An additional embodiment of the invention relates to the administration of a pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any of the therapeutic effects discussed above. Such pharmaceutical compositions may contain a molecule, such as a small molecule, that beneficially modulates expression of a gene associated with preservation or promotion of cognitive function during aging.

[0256] The inventors have unexpectly discovered that reduction of L-glutamate levels in the extracellular space surrounding neurons and glia cells in the brain, including the synaptic cleft and extrasynaptic space, is correlated with preservation or promotion of cognitive function during ageing. In one aspect, the invention provides a method for preserving or promoting cognitive function (e.g., to treat cognitive impairment associated with aging) in a mammal by increasing expression of glutamate transport proteins by brain cells. In a related aspect, the invention provides a method for reducing cognitive impairment associated with aging in a mammal by increasing the activity of glutamate transport proteins expressed in brain cells.

[0257] In one aspect, expression or activity of a glutamate transporter protein is increased by administration of a small molecule to the mammal. Exemplary small molecules include cephalosporin and analogs or derivatives thereof, valproic acid and analogs or derivatives thereof, MS-153 and analogs and derivatives thereof; and agonists of metabotropic glutamate receptors (mGluR's; see Aronica et al. supra). The small molecule may increase the expression or activity of the transporter protein directly (e.g., by interacting with a promoter of a transporter protein-encoding gene, or by interacting with the protein product itself) or indirectly (e.g., increasing expression or activity of a protein that stimulates expression or activity of a transporter protein or decreasing expression or activity of a protein that inhibits expression or activity of a transporter protein); and PACAP (“pituitary adenyl cyclase activator polypeptide”).

[0258] Other exemplary compounds that can increase expression or activity of glutamate transporter proteins include lidocaine (Do et al., Anesth Analg. 2002 95:1263-8 “The effects of lidocaine on the activity of glutamate transporter EAAT3: the role of protein kinase C and phosphatidylinositol 3-kinase”) and kinase inhibitors (e.g., Conradt, J Neurochem. 199768:1244-51 “Inhibition of the high-affinity brain glutamate transporter GLAST-1 via direct phosphorylation”).

[0259] For illustration and not limitation exemplary therapeutic compounds are described in more detail in the following section.

8.1 Exemplary Herapeutic Compositions

[0260] Examples of small molecules, that beneficially modulates expression of a gene associated with promoting or preserving cognitive function during aging (e.g., an EAAT gene) include coumpound related to cephalosporin of the formula I:

[0261] wherein, individually for each occurrence:

[0262] L is O or S;

[0263] R is H, C₁₋₁₀ alkyl, C₁₋₁₀ alkoxy, aryl, aralkyl, —OCH₂CO₂H;

[0264] R¹ is —(CH₂)_(n)—C(O)X

[0265] wherein

[0266] X is OH, NR₂, SH, O-alkali metal, or —OC(CH₃)OC(O)OCH(CH₃)₂; and

[0267] n is an integer from 0 to 6 inclusive;

[0268] R² is H, C₁₋₁₀ alkyl, C₂₋₈ alkenyl, or —(CH₂)_(a)—W—R³

[0269] wherein

[0270] R³ is H, C₁₋₁₀ alkyl, —C(O)C₁₋₁₀ alkyl, —C(O)NR₂, aryl, aralkyl, or A;

[0271] W is O, S, or NR⁴; and

[0272] a is an integer from 1 to 6 inclusive;

[0273] wherein

[0274] R⁴ is H, C₁₋₁₀ alkyl, —C(O)C₁₋₁₀ alkyl, aryl, aralkyl, or R³ and R⁴ taken together may form an unsubstituted or substituted heteroalkyl or heteroaryl ring;

[0275] the

line indicates either a single or double bond;

[0276] R⁵ is R¹, H, SO₃H, aryl, C₁₋₁₀ alkyl, aralkyl; or R⁵ is selected from the group consisting of ═CHCH₂CO₂H and ═NR when the

line is a double bond;

[0277] m is 0 or 1; and

[0278] A is aryl or heteroaryl of formula Ia:

[0279] wherein, independently for each occurrence:

[0280] J is O, S, NR⁶, or CR⁶; and

[0281] y is 1 or 2;

[0282] wherein R⁶ is an electron pair, H, C₁₋₁₀ alkyl, C₁₋₁₀ alkoxy, aryl, or —NR₂;

[0283] or A is heterocycloalkyl of formula Ib or Ic:

[0284] wherein, independently for each occurrence:

[0285] J is O, S, or NR; and

[0286] X is O or H₂.

[0287] Particular compounds of the class described by formula I includes “ceftriaxone” which refers to the broad spectrum cephalosporin antibotic, (6R,7R)-7-[2-(2-Amino-4-thiazolyl)glyoxylamido]-8-oxo-3-[[(1,2,5,6-tetrahydro-2-methyl-5,6-dioxo-as-triazin-3-yl)thio]methyl]-5-thia-1-azabicyclo[4.2.0]oct-2-ene-2-carboxyalic acid,7²—(Z)—(O-methyloxime), disodium salt, sesquaterhydrate. Ceftriaxone is available commercially from Roche under the trade name Rocephin™. Methods for making the compounds of formula may be found in, for example, U.S. Pat. Nos.: 5,574,155; 5,739,346; 5,856,502; 5,869,649; 5,945,414; 5,945,532 and 6,090,801. Derivatives of ceftriaxone include any of the third-generation cephalosporins that are capable of killing aerobic gram-negative rods. Examples of third-generation cephalosporins are cefsulodin, cefotaxime, ceftizoxime, ceftriaxone, cefoperazone, moxalactam, and ceftazidime.

[0288] Other examples of small molecules that beneficially modulate expression of a gene associated with promoting or preserving cognitive function during aging (e.g., an EAAT gene) include compounds related to valproic acid of the formula II:

[0289] wherein, independently for each occurrence:

[0290] X is —OH, C₁₋₁₀ alkoxy, —O-alkali metal, —N(R¹)₂, —SH, or —S—C₁₋₁₀ alkyl;

[0291] R is a straight chain or branched C₁₋₃₀ alkyl; and

[0292] R¹ is H, C₁₋₁₀ alky, C₂₋₁₀ alkenyl, C₂₋₁₀ alkynyl, aryl, or aralkyl;

[0293] provided that R may be unsubstituted or substituted by one or more —OH, C₁₋₁₀ alkoxy, —N(R¹)₂, —SH, —S—C₁₋₁₀ alkyl, or aryl.

[0294] Particular compounds of the class described be formula II include “valproic acid” which refers to 2-propylpentanoate the anticonvulsant drug that may be related to increased brain concentrations of γ-aminobutyric acid (GABA). Other names and descriptions of valproic acid are also envisioned herein, such as Depakote™, Valproate™, Valrelease™ and sodium valproate. Methods for making the compounds of formula may be found in, for example, U.S. Pat. Nos.: 4,558,070; 4,595,695; 4,654,370; 4,895,873; 4,913,906; 5,017,613; 5,019,398; 5,049,586, 5,162,573; 5,440,023; 5,856,569; 6,131,106 and 6,610,326.

[0295] Other examples of small molecules that beneficially modulate expression of a gene associated with promoting or preserving cognitive function during aging (e.g., an EAAT gene) include compounds related to (R)-(−)-5-methyl-1-nicotinoyl-2-pyrazoline of the formula III:

[0296] wherein, independently for each occurrence:

[0297] R is H, C₁-C₁₀ alkyl, C₂-C₁₀ alkenyl, C₂-C₁₀ alkynyl, aryl, or aralkyl;

[0298] R¹ is H, C₁-C₁₀ alkyl, C₂-C₁₀ alkenyl, C₂-C₁₀ alkynyl, aryl, or aralkyl;

[0299] R² is a heterocyclic or heteroaryl ring comprising from 1-4 heteroatoms selected from the following: N, O, or S;

[0300] L is O, S, or NR; and

[0301] X is CR₂, O, or S.

[0302] A particular compound of the formula III includes (R)-(−)-5-methyl-1-nicotinoyl-2-pyrazoline (MS-153).

[0303] Also included in the methods of the present invention are pharmaceutically acceptable addition salts and complexes of the compounds of formula I, II and III. In cases wherein the compounds may have one or more chiral centers, unless specified, the present invention comprises each unique racemic compound, as well as each unique nonracemic compound. In cases in which the compounds have unsaturated carbon-carbon double bonds, both the cis (Z) and trans (E) isomers are within the scope of this invention. In cases wherein inhibitors may exist in tautomeric forms, such as keto-enol tautomers, such as

[0304] each tautomeric form is contemplated as being included within this invention, whether existing in equilibrium or locked in one form by appropriate substitution with R′. The meaning of any substituent at any one occurrence is independent of its meaning, or any other substituent's meaning, at any other occurrence.

[0305] Also included in the methods of the present invention are prodrugs of the compounds of formula I, II and III.

[0306] The pharmaceutical compositions utilized in this invention may be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means.

[0307] The compositions may be administered alone or in combination with at least one other agent, such as a stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions may be administered to a patient alone, or in combination with other agents, drugs, or hormones.

[0308] Certain compounds contained in compositions of the present invention may exist in particular geometric or stereoisomeric forms. In addition, polymers of the present invention may also be optically active. The present invention contemplates all such compounds, including cis- and trans-isomers, R- and S-enantiomers, diastereomers, (D)-isomers, (L)-isomers, the racemic mixtures thereof, and other mixtures thereof, as falling within the scope of the invention. Additional asymmetric carbon atoms may be present in a substituent such as an alkyl group. All such isomers, as well as mixtures thereof, are intended to be included in this invention.

[0309] If, for instance, a particular enantiomer of compound of the present invention is desired, it may be prepared by asymmetric synthesis, or by derivation with a chiral auxiliary, where the resulting diastereomeric mixture is separated and the auxiliary group cleaved to provide the pure desired enantiomers. Alternatively, where the molecule contains a basic functional group, such as amino, or an acidic functional group, such as carboxyl, diastereomeric salts are formed with an appropriate optically-active acid or base, followed by resolution of the diastereomers thus formed by fractional crystallization or chromatographic means well known in the art, and subsequent recovery of the pure enantiomers.

[0310] For purposes of this invention, the chemical elements are identified in accordance with the Periodic Table of the Elements, CAS version, Handbook of Chemistry and Physics, 67th Ed., 1986-87, inside cover. Also for purposes of this invention, the term “hydrocarbon” is contemplated to include all permissible compounds having at least one hydrogen and one carbon atom. In a broad aspect, the permissible hydrocarbons include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, aromatic and nonaromatic organic compounds that may be substituted or unsubstituted.

8.2 Therapeutic Compositions

[0311] Contemplated equivalents of the compositions described herein include compositions which otherwise correspond thereto, and which have the same general properties thereof, wherein one or more simple variations of substituents or components are made which do not adversely affect the characteristics of the compositions of interest.

[0312] In addition to the active ingredients, e.g., one or more of the therapeutic agents, pharmaceutical compositions of the present invention may contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically. Further details on techniques for formulation and administration may be found in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing Co., Easton, Pa.).

[0313] Pharmaceutical compositions for oral administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the patient.

[0314] Pharmaceutical preparations for oral use can be obtained through combining active compounds with solid excipient and processing the resultant mixture of granules (optionally, after grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, and alginic acid or a salt thereof, such as sodium alginate.

[0315] Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for product identification or to characterize the quantity of active compound, i.e., dosage.

[0316] Pharmaceutical preparations which can be used orally include push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with or without stabilizers.

[0317] Pharmaceutical formulations suitable for parenteral administration may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain substances which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils, such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic amino polymers may also be used for delivery. Optionally, the suspension may also contain suitable stabilizers or agents to increase the solubility of the compounds and allow for the preparation of highly concentrated solutions.

[0318] For topical or nasal administration, penetrants appropriate to the particular barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.

[0319] The pharmaceutical compositions of the present invention may be manufactured in a manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes.

[0320] The pharmaceutical composition may be provided as a salt and can be formed with many acids, including but not limited to, hydrochloric, sulfiric, acetic, lactic, tartaric, malic, and succinic acids. Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding free base forms. In other cases, the preferred preparation may be a lyophilized powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0.1% to 2% sucrose, and 2% to 7% mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to use.

[0321] After pharmaceutical compositions have been prepared, they can be placed in an appropriate container and labeled for treatment of an indicated condition. For administration of ceftriaxone, for example, such labeling would include amount, frequency, and method of administration.

[0322] Pharmaceutical compositions suitable for use in the invention include compositions wherein the active ingredients are contained in an effective amount to achieve the intended purpose. The determination of an effective dose is well within the capability of those skilled in the art.

[0323] For any compound, the therapeutically effective dose can be estimated initially either in cell culture assays, e.g., according to the method of Aronica et al., supra, or in animal models such as mice, rats, rabbits, dogs, or pigs. An animal model may also be used to determine the appropriate concentration range and route of administration. A particularly preferred animal model uses behaviorally characterized rats as described herein. Such information can then be used to determine useful doses and routes for administration in humans.

[0324] A therapeutically effective dose refers to that amount of active ingredient, for example ceftriaxone, ceftriaxone analogs, ceftriaxone derivatives, valproic acid, valproic acid analogs, valproic acid derivative, MS-153, MS-153 analogs or MS-153 derivatives, which ameliorate the symptoms or condition. Therapeutic efficacy and toxicity may be determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating the ED₅₀ (the dose therapeutically effective in 50% of the population) or LD₅₀ (the dose lethal to 50% of the population) statistics. The dose ratio of therapeutic effects to toxic effects is the therapeutic index, which can be expressed as the LD₅₀/ED₅₀ ratio. Pharmaceutical compositions which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are used to formulate a range of dosage for human use. The dosage contained in such compositions is preferably within a range of circulating concentrations that includes the ED₅₀ with little or no toxicity. The dosage varies within this range depending upon the dosage form employed, the sensitivity of the patient, and the route of administration.

[0325] The exact dosage will be determined by the practitioner, in light of factors related to the subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active moiety or to maintain the desired effect. Factors which may be taken into account include the degree of cogntive impairment, the general health of the subject, the age, weight, and gender of the subject, time and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance rate of the particular formulation.

[0326] Normal dosage amounts may vary from about 0.1 μg to 100,000 μg, up to a total dose of about 1 gram, depending upon the route of administration. Guidance as to particular dosages and methods of delivery is provided in the literature and generally available to practitioners in the art.

[0327] To be exert a therapeutic effect on central nervous system targets, such as EAAT 1, 2, 3, 4 or 5, the compounds used in the methods of the present invention should readily penetrate the blood-brain barrier when peripherally administered. Compounds which cannot penetrate the blood-brain barrier, however, can still be effectively administered directly into the central nervous system, e.g., by an intraventricular route.

[0328] The term “pharmaceutically-acceptable salts” is art-recognized and refers to the relatively non-toxic, inorganic and organic acid addition salts of compounds, including, for example, those contained in compositions of the present invention.

[0329] The term “pharmaceutically acceptable carrier” is art-recognized and refers to a pharmaceutically-acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, solvent or encapsulating material, involved in carrying or transporting any subject composition or component thereof from one organ, or portion of the body, to another organ, or portion of the body. Each carrier must be “acceptable” in the sense of being compatible with the subject composition and its components and not injurious to the patient. Some examples of materials which may serve as pharmaceutically acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) starches, such as corn starch and potato starch; (3) cellulose, and its derivatives, such as sodium carboxymethyl cellulose, ethyl cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; (6) gelatin; (7) talc; (8) excipients, such as cocoa butter and suppository waxes; (9) oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, corn oil and soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerin, sorbitol, mannitol and polyethylene glycol; (12) esters, such as ethyl oleate and ethyl laurate; (13) agar; (14) buffering agents, such as magnesium hydroxide and aluminum hydroxide; (15) alginic acid; (16) pyrogen-free water; (17) isotonic saline; (18) Ringer's solution; (19) ethyl alcohol; (20) phosphate buffer solutions; and (21) other non-toxic compatible substances employed in pharmaceutical formulations.

[0330] The terms “systemic administration,” “administered systemically,” “peripheral administration” and “administered peripherally” are art-recognized and refer to the administration of a subject composition, therapeutic or other material other than directly into the central nervous system, such that it enters the patient's system and, thus, is subject to metabolism and other like processes, for example, subcutaneous administration.

[0331] The terms “parenteral administration” and “administered parenterally” are art-recognized and refer to modes of administration other than enteral and topical administration, usually by injection, and includes, without limitation, intravenous, intramuscular, intraarterial, intrathecal, intracapsular, intraorbital, intracardiac, intradermal, intraperitoneal, transtracheal, subcutaneous, subcuticular, intra-articular, subcapsular, subarachnoid, intraspinal, and intrastemal injection and infuslion.

9. EXEMPLIFICATION

[0332] The invention, having been generally described, may be more readily understood by reference to the following examples, which are included merely for purposes of illustration of certain aspects and embodiments of the present invention, and are not intended to limit the invention in any way.

9.1 Characterizing the Young, Aged-impaired (AI) and Aged-unimpaired (AU) Animals

[0333] 9.1.1 Morris Water Maze (MWM) and Radial Arm Maze (RAM) Subjects

[0334] We performed behavioral tests on 9 young (4-6 mo) and 18 aged (25-27 months) pathogen-free male Long-Evans rats with the MWM and used the same animals for microarray analysis. An additional 10 aged rats were tested in the MWM, followed by training and testing in the RAM to assess test-retest reliability for individual differences in cognitive function across the two tasks.

[0335] 9.1.2 Morris Water Maze Apparatus

[0336] The MWM apparatus consists of a large, circular pool (diameter 1.83 m; height, 0.58 m) filled with water (27° C.) that has been made opaque through the addition of non-toxic pigment or some other substance. In the typical “hidden platform” version of the task, rats are trained to find a camouflaged white escape platform (height, 34.5 cm) that is positioned in the center of one quadrant of the maze just 1.0 cm below the water surface. This platform could be retracted to the bottom of the tank or raised to its normal position from outside the maze during behavioral testing. The location of this platform remained constant from trial to trial. Because there were no local cues that marked the position of the platform, the rat's ability to locate it efficiently from any starting position at the perimeter of the pool depended on using information surrounding the maze. The maze was surrounded by black curtains with white patterns affixed to provide a configuration of spatial cues. A second platform (height 37.5 cm), with its surface painted black was elevated 2 cm above the water surface during cue training, the version of the task used to control for factors unrelated to cognition. The behavior of a rat in the pool was recorded by a camera suspended 2.5 m above the center of the pool, connected to a video tracking system (HVS Image Advanced Tracker VP200) and a PC computer running HVS software developed by Richard Baker of HVS Image, Hampton, UK.

[0337] 9.1.3 Morris Water Maze Procedure

[0338] We optimized the MWM protocol for sensitivity to the effects of aging on cognition and for measures of reliable individual differences within the aged population of out-bred Long-Evans rats (Gallagher M, Burwell R, Burchinal M. Behav. Neurosci. 107:618-626; 1993).

[0339] Rats received three trials per day for 8 consecutive days, using a 60 sec intertrial interval. On each training trial, the rat was released in the maze from one of four equally spaced starting positions around the perimeter of the pool. The starting position varied from trial to trial, thus preventing the use of a response strategy (e.g. always turning left from the start location to locate the escape platform). If a rat did not locate the escape platform within 90 sec on any trial, the experimenter guided the rat to the platform, where it remained for 30 sec. Every sixth trial consisted of a probe trial to assess the development of spatial bias in the maze. During these trials, the rat swam with the platform retracted to the bottom of the pool for 30 sec, at which time the platform was raised to its normal position for completion of an escape trial. At the completion of the protocol using the hidden platform, rats were assessed for cue learning using the visible platform. The location of this platform varied from trial to trial in a single session of 6 training trials.

[0340] We used the proximity of the animal's position with respect to the goal for analysis of training trial and probe trial performance. The proximity measure was obtained by sampling the position of the animal in the maze (10×/sec) to provide a record of distance from the escape platform in 1 sec averages. For both probe trials and training trials, a correction procedure was implemented so that trial performance was relatively unbiased by differences in distance to the goal from the various start locations at the perimeter of the pool. In making this correction the average swimming speed was calculated for each trial (pathlength/latency). Then the amount of time required to swim to the goal at that speed from the start location used on the trial was removed from the record prior to computing trial performance, i.e. cumulative distance on training trials and average distance from the goal on probe trials. Thus, scores obtained using the proximity measure are designed to reflect search error, representing deviations from an optimal search, i.e. direct path to the goal and search in the immediate vicinity of that location during probe trials. ps 9.1.4 Morris Water Maze Analysis

[0341] Computer records of video-tracking were compiled to provide data on each rat's performance in the maze. Measures on training trials and probe trials were analyzed by Analysis of Variance.

[0342] 9.1.5 Morris Water Maze Data Results

[0343] The performance during training with the hidden, camouflaged platform differed between the groups of young and aged rats [F(1,23)=12.69, p<0.002]. No difference between the groups occurred for the cue training trials with a visible platform. Latencies to escape during cue training averaged 9.36 seconds for young and 10.60 seconds for the aged rats.

[0344] The average proximity measure on interpolated probe trials was used to calculate a spatial learning index for each individual subject as described in detail in Gallagher M, Burwell R, Burchinal M. Behav. Neurosci. 107:618-626; 1993. When a rat rapidly learned to search for the platform close to its position, it's spatial learning index is low. Overall, aged rats differed from young [F(1,23)=15.18, p<0.001]. Aged rats were classified as either unimpaired or impaired relative to the learning index profile of the young study population. Aged rats that fall within the normative range of young rats (index scores <241) were designated aged unimpaired (FIG. 1). The remaining aged subjects that have index scores outside the range of young performance were designated aged impaired.

[0345] 9.1.6 Radial Arm Maze Apparatus

[0346] Each arm (7×75 cm) of the elevated eight arm radial maze projected from each facet of an octagonal center platform (30 cm diameter, 51.5 cm height). Clear side walls on the arms were 10 cm high and were angled at 65° to form a trough. A food well (4 cm diameter, 2 cm deep) was located at the distal end of each arm. Blocks constructed of Plexiglas (30 cm H×12 cm W) could be positioned to block entry to any arm. Numerous extra maze cues were provided in the room surrounding the apparatus and lighting was provided by overhead fixtures.

[0347] 9.1.7 Radial Arm Maze Procedures

[0348] Rats were first habituated to the maze for an 8 min session on four consecutive days. In each of these sessions food rewards were scattered on the RAM, initially on the center platform and arms and then progressively confined to the arms. After this habituation phase, a standard training protocol was used in which a food pellet was located at the end of each arm. Rats received one trial each day for 18 days; each daily trial terminated when all eight food pellets had been obtained or when either 16 choices were made or 15 min had elapsed. An error consisted of returning to an arm (all four paws on the arm) from which food had already been obtained. After completion of this phase, the memory demand of the task was increased by imposing a delay during the trial. At the beginning of each trial three arms were blocked. The identity and configuration of the blocked arms was varied across trials. Rats were allowed to obtain food on the five arms to which access was permitted at the beginning of the trial. The rat was then removed from the maze for 60 s, during which time the barriers on the maze were removed, thus allowing access to all eight arms. Rats were then placed back onto the center platform and allowed to obtain the remaining food rewards.

[0349] 9.1.8 Radial Arm Maze Analysis

[0350] A memory error occurred during test trials using a 60 second delay when a rat returned to one of the five arm that was already visited prior to the delay. Each rat's performance was averaged across four consecutive test trials. Parametric statistics (unpaired t-tests) were used to compare performance between young and aged groups. Correlational analysis (Pearson's r) was used to examine the relationship between performance of aged rats (N=10) in the Morris Water maze (learning index scores) and radial-arm maze (memory errors).

[0351] 9.1.9 Radial Arm Maze Results

[0352] The performance of young adult rats in the delay version of the RAM varies as a function of the delay interval, ranging from 60 seconds to eight hours (Chappell et al. Neuropharmacology 37: 481-488, 1998). Aged rats previously characterized in the MWM, committed more memory errors after a 60 second delay relative to young rats (p <0.025). On average young rats committed 0.17 errors, whereas aged rats committed an average of 1.52 errors. The ten aged rats, however, exhibited a wide range of performance on the RAM. A significant relationship was found between the initial MWM characterization and memory performance in the RAM (r value=0.82, data shown in FIG. 2).

9.2 Gene Expression Analysis of the Young, Aged-impaired (AI) and Aged-unimpaired (AU) Animals

[0353] 9.2.1 Preparation of RNA from Behaviorally Characterized Animals

[0354] Twenty-seven behaviorally characterized rats (data shown in FIG. 1) were killed with an overdose of sodium pentobarbital (100 mg/kg). The hippocampus was dissected bilaterally and frozen (−80° C.). One hippocampus from each animal was weighed and homogenized in the appropriate volume of phenol-guanadine isothiocyanate (Trizol reagent; 1 ml per 100 mg of tissue with a minimum volume of 1 ml). Each sample was extracted with chloroform (200 μl per ml of Trizol) and precipitated with isopropanol. RNA pellets were air dried and resuspended in DEPC treated water. All samples were stored at −80° C. A portion of the RNA was further purified using Qiagen's RNeasy mini RNA extraction kit according to manufacturer's instructions and subsequently stored at −80° C. Samples were quantified by absorbance at 260 nm and purity determined by ratio of absorbance at 260 nm and 280 nm. Sample integrity and concentration was confirmed by agarose gel electrophoresis. Photographs of agarose gels were scanned, the pixels were inverted and quantified using NIH-image. Concentrations were then adjusted if needed.

[0355] For analysis on gene microarrays, samples from three rats of the same phenotype, either Y, AI or AU, were pooled to yield independent microarray analysis for a sample size of three GeneChips®/phenotype. With respect to behavioral characterization by spatial learning index in the MWM, tissue was pooled as set forth in Table I TABLE I Young Aged Unimpaired Aged Impaired Y1: 143.1, 171.4, AU1: 181.5, 202.9, 229.5 AI1: 268.3, 283.2, 381 194.6 Y2: 139.5, 169.4, AU2: 183.9, 218.4, 229.3 AI2: 285.5, 303.4, 337.7 186.6 Y3: 169, 173.9, AU3: 193.3, 228.2, 230.6 AI3: 282.2, 307.2, 329.3 196.5

[0356] 9.2.2 Reverse Transcription and Hybridization to Microarray

[0357] Labeled cRNA probes for hybridization were prepared using the Affymetrix Enzo Bioarray high yield RNA transcript labeling system. RNAs were reverse transcribed into cDNA and converted to biotin labeled cRNA. Internal standards provided with each labeling system were added to test RNA prior to reverse transcription. cRNAs were then tested on control chips to ensure that reverse transcription and labeling were optimal before performing hybridization onto experimental GeneChips®. cRNAs were applied to U34A Affymetrix GeneChip® arrays. These arrays included specific sequences for 7000 expressed rat genes and 1000 EST clusters, and included all genes represented on a recently developed, smaller, neuroscience gene microarray. A GeneChip Fluidics Station automated introduction of the labeled cRNAs on to the gene arrays and hybridization as conducted in a GeneChip hybridization oven. A GeneArray scanner was used to detect and quantify hybridization signals for each oligomer set based on confocal laser scanning.

[0358] 9.2.3 Data Analysis of Microarray

[0359] A Genechip Analysis Suite and Affymetrix MAS 4.0 and the more recently developed MAS 5.0 algorithms were used in our analysis of data. Both algorithms had a default threshold based on known negative genes and average signal intensities per chip. Normalization and scaling methods based on predefined oligomer sets defined by Affymetrix were applied to permit comparisons between Genechips. Both algorithms generated values for levels of expression per set of perfectly matched (PM) oligomers corresponding to each expressed gene, relative to a set of mismatched (MM) oligomers designed with variant bases calculated to inhibit hybridization of a perfectly matched cRNA. The empirical MAS 4.0 algorithm used raw data to generate an average difference call which provides a measure of the hybridization signal intensity for PM oligomers relative to control MM oligomers for each gene. An absolute call of present (P), marginal (M), or absent (A) was based on the number of PM oligomers which were positive relative to MM oligomers. The statistical MAS 5.0 algorithms relied on essentially the same type of comparison but applied statistical methods to generate a p value to enhance the probability that a call of present reflected a level of expression higher than background. In addition, statistical criteria were applied to the signal algorithm in calculating hybridization signal intensity which were intended to minimize the impact of outliers within PM and MM oligomers in each probe set. The basic difference in data representing expression levels yielded by the two algorithms was that MAS 5.0 was designed to eliminate negative expression levels which would lead to an overestimate of the number of genes with altered expression levels between two comparison groups. Overall the signal obtained per oligomer set with MAS 5.0 was less than that with MAS 4.0.

[0360] The power of our model lies, in part, in the ability to compare across the three groups Y, AU and AI to identify those genes which either change between young and aged hippocampus and thus generally relate to the aging process and those which discriminate AU and AI rats. The genes identified through this process relate specifically to aging-cognitive impairment or preservation of cognitive function.

[0361] In generating the data disclosed herein we conducted a series of analytic steps to identify 3 sets of genes informative for the model of age and cognitive impairment. Set 1 comprised genes of interest that differ from young as a function of age alone. Set 2 comprised of genes of interest that differ in the impaired aged rats relative to both young and aged unimpaired. Set 3 (referred to as Aged Unimpaired genes) consisted of genes that differ in the aged unimpaired relative to both young and aged impaired and may, therefore, related to age-induced preservation of cognitive function. Each set was generated from the full microarray dataset following the MAS 5.0 analysis and then used similar algorithms for an effect size analysis.

[0362] Herein we described the steps for generating Set 3 (Aged Unimpaired genes), in which glutamate transporters were included. In the first step the values for all probe sets on the chips representing gene expression in the young rats (N=3) and chips for the aged impaired (N=3) were examined for a detection criterion. All probe pair sets that did not meet the detection criterion using the absolute call from the MAS 5.0 analysis were eliminated from further consideration. A simple effects analysis was then conducted to determine that the values on the chips for the two groups (young and aged impaired) did not differ by an effect size of greater than 1.0. All probe sets that met both detection criterion and the criterion for pooling in the comparison group (young and aged impaired not differing by an effect size of 1.0 or greater). A simple effects analysis was then conducted on those pooled probe sets for young and aged impaired along with the corresponding probe sets from the aged unimpaired chips. The simple effects analysis yielded 384 probe sets, representing “genes of interest” for age-induced preservation of cognitive function, with an effect size of 1.25 or greater. Power analysis for inferential statistical tests of significance indicated that the sample sizes of the microarray experiment (3 aged unimpaired chips and 6 chips in the pooled comparison group) would detect a difference at p<0.05 with 80% power for genes with an effect size of 2.5 or greater. In follow-up analysis with sample sizes of 9 aged unimpaired, and 9 subjects in each of the comparison groups (young and aged impaired) that are now underway, statistical power at 80% could be expected for effect sizes of 1.25 and greater.

[0363] 9.2.4 Results from the Microarray

[0364] Different transcripts for glutamate transporters were detected in the human and rodent nervous system (FIG. 3). Two of these transcripts were not expressed in the hippocampal formation (EAAT 4 and EAAT5). The three remaining transcripts had differing patterns of cellular localization in neurons and glia, and were all expressed in the hippocampus. In the effect size analysis for Aged Unimpaired genes (Set 3) in the microarray dataset GLT1 had an effect size of 5.87 (the largest value in the effect size analysis for aged unimpaired genes) and GLAST, a second glutamate transporter, had an effect size of 1.93 (FIG. 4). In both cases the glutamate transporter MRNA was increased in the unimpaired aged rats relative to the pooled comparison of young and aged impaired. As expected from the power analysis for the microarray experimental design, GLT1 MRNA was significantly increased (p<0.0002) relative to the comparison young and aged impaired. In a similar analysis, GLAST was also significantly increased (p=0.0484; FIG. 4).

[0365] An additional indication that glutamate is regulated differently in aged rats with preserved cognitive function was provided by a difference in mRNA for aspartate aminotransferase in Aged Unimpaired relative to Young and Aged Impaired. The probe set for aspartate aminotransferase, a major mechanism for inactivation of extracellular glutamate, showed a significant increase in the AU rats relative to the comparison Y+AI, which did not differ from each other. The effect size for that mRNA was 2.58.

[0366] A probe set on the microarray was an identified gene for pituitary adenyl cyclase activator polypeptide (PACAP; GenBank Accession No.: AI227715; EST224410), that regulates glutamate transport and metabolism (Figiel and Engele, J. Neurosci. 15: 3596-3605, 2000). PACAP MRNA was significantly elevated in aged unimpaired relative to comparison young and aged impaired rats (Effect size 3.29, t=4.13, p<0.005; FIG. 4).

[0367] Because PACAP receptors are of a type that exhibit desensitization, a strong increase in β-arrestin 2 MRNA may participate in effects on glutamate tranporters. Beta-arrestin 2 has a dual role in receptor endocytosis and in mediating signaling cascades through the same receptors (Wei et al., PNAS, 100;10782-7, 2003; Ahn et al., PNAS, 100;1740-4, 2003; e.g. GenBank Accession No.: XM_(—)345084). The probe sets for β-arrestin 2 in the aged unimpaired rats showed significant elevation relative to comparison young and aged impaired (Effect size=2.78, t=2.59, p<0.05).

[0368] 9.2.5 In situ Hybridization Histochemistry Analysis

[0369] A follow-up analysis was conducted using an independent set of animals (N of 3 or 4 per group) using sections of hippocampus processed for in situ hybridization histochemistry.

[0370] 9.2.6 In situ Hybridization Probe Preparation

[0371] We used reverse-transcriptase polymerase chain reaction (RT-PCR) to generate specific probes corresponding to glutamate transporters identified as up-regulated in hippocampus AU rats by the microarray analyses. A sample RNA prepared from hippocampus of aged rats (pool of animals with learning index scores within (AU) or outside (AI) the scores of young animals was reverse transcribed using oligodT and reverse transcriptase. This cDNA was then used as a template with sense and antisense oligomer probes (Table II) in a first round PCR reactions to amplify double stranded cDNAs corresponding to the particular glutamate transporters described below. Standard PCR conditions were employed (initial denaturation at 94° C. for 4 min, initial annealing at 72° C. for 40 sec, 35 cycles of 94° C., 30 sec denaturation, 55° C., 30 sec annealing and 72° C. 30 sec extension, a final extension at 72° C. for 4 min and cooling to 4° C.). Aliquots of PCR products were electrophoresed on ethidium bromide-agarose gels to verify that probes of appropriate size were generated. A second round PCR reaction was used with each of the amplified cDNAs and extended oligomers (Table II) comprising a sense oligomer corresponding to an SP6 promoter sequence plus the original sense oligomer used to derive each glutamate transporter PCR product, and an antisense oligomer comprising the T7 promoter sequence and the original antisense oligomer for each glutamate transporter. This generated specific glutamate transporter cDNAs with SP6 and T7 promoters on sense and antisense strands, respectively. PCR products were collected by ethanol precipitation and the sequences verified by nucleotide sequencing using an automated DNA sequencer and SP6 primers.

[0372] Sequence-verified PCR products corresponding to each glutamate transporter with SP6 and T7 promoter sites were used in DNA directed in vitro transcription of sense and antisense RNA from each second round glutamate transporter PCR product. In vitro transcription reactions were performed with each PCR product, SP6 or T7 polymerase (each enzyme used in separate reactions for each probe), unlabelled nucleotide triphosphates and high specific activity ³⁵S labeled uridine triphosphate. These reactions yielded three radiolabeled antisense glutamate transporter probes which specifically recognize each of the different glutamate transporter mRNAs in tissue sections by base pairing and corresponding sense probes which comprised the same sequence as glutamate transporter mRNAs, cannot base pair with the mRNAs and served as a negative control. Antisense probes were incubated with frozen sections of hippocampus from Y, AU an AI rats representing sections throughout the entire hippocampus. Sense probes were hybridized with sections from fewer selected regions of hippocampus but sufficient to verify uniformly negative hybridization signals and validate specificity of hybridization signals obtained with antisense probes. After hybridization and washing, sections were apposed to X-ray film and standard quantitative autoradiography (Bizon 2001) was used to quantify signals.

[0373] Oligomers used to derive templates for preparation of glutamate transporter sense and antisense RNA probes for in situ hybridization histochemistry were described in Table II. TABLE II GLT-1 EAAC1 GLAST PCR product 405 bp 409 bp 441 bp Sense 5′-GAGCATTGGTG 5′-GTCTGAGAACA 5′-GGTAGAAGCCT CAGCCAGTA-3′ AGACAAAGG-3′ GCTTTAAAC-3′ Antisense 5′-CCAAGGTTCTTCC 5′-TGAGAGCTGTCA 5′-GGCATGAATGAG TCAACAC-3′ GGAGAGC-3′ GAGGCCGAC-3′ Extended SP6 5′-tatttaggtgacactatagGAGCA 5′-tatttaggtgacactatagGTCT 5′-tatttaggtgacactatag GGTA Sense TTGGTGCAGCCAGTA-3′ GAGAACAAGACAAAG GAAGCCTGCTTTAAAC-3′ G-3′ Extended T7 5′taatacgactcactataggggCCAA 5′taatacgactcactataggggTGA 5′-taatacgactcactataggggGG Sense GGTTCTTCCTCAAC-3′ GAGCTGTCAGGAGAG CATGAATGAGGAGGCCGA C-3′ C-3′

[0374] 9.2.7 Preparation of Tissue for in situ Hybridization Histochemistry

[0375] Eleven behaviorally characterized rats (n=4 young and n=7 old) were killed with an overdose of sodium pentobarbital (100 mg/kg) and intracardial perfusion with 4% paraformaldehyde in 0.1 M phosphate buffer, pH 7.4 (PPB). All perfusions were performed between 0700 h and 1000 h. Following fixation, brains were removed from the cranium and both right and left hippocampi were immediately dissected from the surrounding tissue. Dissected hippocampi were postfixed in PPB for 24 h, cryoprotected in PPB containing 20% sucrose for 24 h, frozen on powdered dry ice in an “extended” orientation, and stored at −80° C. until further processing.

[0376] The hippocampus for each animal was sectioned (25 mm) coronal to the longitudinal axis using a freezing microtome, and collected into cold PPB. Free-floating sections of tissue were washed in 0.75% glycine in 0.1 M phosphate buffer, pH, 7.2 (PB) and 0.1M PB alone to remove excess fixative. Sections were treated for 30 min at 37° C. with proteinase K (1 mg/ml in 0.1 M Tris buffer containing 0.05% SDS), acetylated in 0.25% acetic anhydride in 0.1 M triethanolamine, pH 8.0, and rinsed twice in 2 saline sodium citrate buffer (SSC; 1×SSC=0.15 M sodium chloride and 0.015 M sodium citrate, pH 7.0). Tissue was then hybridized for 42-44 h at 60° C. in solution containing 50% formamide, 1×Denhardt's solution, 10% dextran sulfate, 4×SSC, 0.25 mg/ml yeast tRNA, 0.3 mg/ml herring sperm DNA, 100 mm dithiothreitol (DTT), and the appropriate ³⁵S-labeled cRNA at a final concentration of 1×107 CPM/ml. Following hybridization, sections were washed at 30 min intervals, twice in 4×SSC, once in 50% formamide/2×SSC at 60° C. and then treated with ribonuclease A (20 mg/ml in 10 mM Tris saline buffer containing 1 mM ethylene-diaminetetracetic acid) for 30 min at 37° C. Tissue sections were washed further in descending concentrations of SSC buffer containing 100 mM DTT to a final wash of 0.1×SSC and mounted onto gelatin-coated slides for film autoradiography. Air-dried sections of the hippocampal sections were exposed with ¹⁴C-standards (American Radiolabeled Chemicals, Inc., St. Louis, Mo.) to β-max hyperfilm (Amersham Pharmacia Biotech, Piscataway, N.J.) for 24-72 hours. The exposure time for rostral coronal sections was 110-130 hours. Films were developed using GBX developer and fixed with Kodak rapid fixer.

[0377] In situ hybridization labeling was quantified by densitometric analysis of film autoradiograms using the MCID imaging system (Imaging Research, St. Catherine's, Ontario, Canada). Film densities were linearized and calibrated relative to the ¹⁴C-labeled standards that were exposed to each sheet of film with tissue sections. Values for hybridization signal intensity (mCi/gram protein) were calculated for each rat as the average of multiple measures from 6-8 tissue sections. Mean hybridization signal intensities for each rat within a group were averaged to obtain a group mean ±standard error. Statistical comparisons were made using a one-way ANOVA. For all statistical tests, a 95% confidence level (p <0.05) was considered significant.

[0378] 9.2.8 Results of in situ Hybridization Histochemistry

[0379] The abundance of GLT1 was significantly greater in aged unimpaired relative to young and aged impaired (p<0.03), whereas a trend for greater GLAST MRNA was also found (p=0.089; FIG. 5). The learning index scores representing cognitive status in the set of rats used for in situ hybridization were similar to the animals in the microarray study; aged unimpaired (182, 223, 240) young (177, 219, 200, 227) and aged impaired (290, 285, 297, 298).

[0380] In the microarray dataset and the in situ dataset, values for a third glutamate transporter mRNA (EAAC1) were numerically higher in the unimpaired aged group than in the comparison young and impaired aged rats, but those differences were not statistically significant. The mRNA for one probe set in the microarray (AF038571) showed that EAAC1 was significantly elevated in the aged unimpaired relative to aged impaired. The MRNA in a second probe set for EAAC1 showed a similar trend (aged unimpaired greater than aged impaired, p=0.09).

9.3 Preserved Cognitive Function Using Ceftriaxone

[0381] 9.3.1 Effect of Ceftriaxone Treatment on GLT1 mRNA in Young Rats

[0382] Young rats were given daily injection of ceftriaxone at 200 mg/kg intramuscularly (N=2) or vehicle (N=3) for one week. After sacrifice, the hippocampus was dissected and frozen.

[0383] 9.3.2 Real Time Reverse Transcription-PCR Method for Quantifying GLT-1 mRNA

[0384] 9.3.2.1 Preparation of RNA for Real Time RT-PCR

[0385] RNA was extracted from the right hippocampus of ceftriaxone or saline treated animals (3 young animals per condition) using Trizol reagent and purified over Qiagen RNeasy columns in a manner identical to the procedure used for microarray experiments. After the concentration and integrity was determined (in the same manner as for microarray RNA) samples were diluted to a concentration of 50 ng per microliter. 100 ng of this sample was reverse transcribed in a total volume of 10 μl using Applied Biosysten's Taqman reverse transcription reagents with the following conditions: 1× reverse transcription buffer, 0.5 mM of each dNTP, 5.5 mM MgCl2, 1.25 Units/μL of Multiscribe reverse transcriptase, 0.4 Units/μL RNase inhibitor, and 2.5 μM oligo dT primer. Samples were incubated at room temperature for 10 minutes followed by 45 minutes at 48° C. and then 5 minutes at 95° C. Samples were diluted 1:20 and 1:100 for use in a real time PCR reaction. A Standard RNA was generated by combining the extracted and purified hippocampal RNA from 2 separate animals and reverse transcribing this RNA in conditions identical to that for the experimental samples above except that 500 ng of RNA is used in a 50 μl reaction. Standard cDNA was diluted to 1:20, 1:100, 1:500 and 1:2500 by serial dilution. In addition 200 ng of standard RNA was placed in a 20 μl reaction with the same conditions as above except that the reverse transcriptase is omitted. This sample was referred to as the no RT sample and is included to indicate background activity of the RNA sample itself. This sample was serially diluted 1:20 and 1:100.

[0386] 9.3.2.2 Real Time PCR

[0387] PCR reactions were performed in triplicate on each cDNA sample at two different concentrations for GLT1a, GLT1 and GAPDH using the 1:20 and 1:100 dilution of the cDNA. The final concentration of cDNA in the PCR reaction was 100 pg/μl and 20 pg/μl, and was based on extrapolation from the concentration of input RNA. The standard was used at all four dilutions to generate extrapolated final concentrations of 100 pg/μl, 20 pg/μl, 4 pg/μl and 0.8 pg/μl. A PCR reaction mix using Invitrogen's Platinum quantitative PCR kit was assembled for all samples using the same primer and probe set which was then divided into separate tubes for each cDNA at each concentration. The cDNA was then added to the mixture which was then distributed to each of three real time PCR tubes. All reactions were run in a RotorGene 3000 (Corbett Research) with the following conditions: 2 minutes at 50° C., 5 minutes at 95° C. and then 45 cycles of 25 seconds at 95° C. and 60 seconds at 60° C. Data was acquired on the Joe channel for GAPDH and FAM/SYBR channel for GLT1 and GLT1a probe/primer sets. Spike suppression and dynamic tube normalization were used for all runs. GLT1 or GLT1a and GAPDH real time PCR was occasionally performed during separate runs.

[0388] For GAPDH the optimal reaction conditions were 0.6 Units Platinum Taq DNA polymerase, 20 mM Tris-HCl (pH 8.4), 50 mM KCl, 200 μM dGTP, 200 μM dATP, 200 μM dCTP, 400 μM dUTP, 0.4 Units UDG, 4.5 mM MgCl2, 200 nM Forward primer, 200 nM Reverse primer, 50 nM probe labeled with Vic on the 5′ end and TAMRA quencher on the 3′ end. Primers and probe were obtained from Applied Biosystems(Cat # 4308313); the sequences were unknown, but the amplicon length was 177 bps.

[0389] For GLT1 the optimal reaction conditions were 0.6 Units Platinum Taq DNA polymerase, 20 mM Tris-HCI (pH 8.4), 50 mM KCI, 200 μM dGTP, 200 μM dATP, 200 μM dCTP, 400 μM dUTP, 0.4 Units UDG, 6.0 mM MgCl2, 50 nM Forward primer, 200 nM Reverse primer, 50 nM probe. The amplicon length was 65 bps. The forward primer was 5′ GAG CTG GAC ACC ATT GAC TC 3′ and reverse primer was 5′ GAC TGC GTC TTG GTC ATT TC 3′. The probe was 5′ CAA CAC CGA ATG CAC GAA GAC ATC 3′ labeled with a 5′ 6-fam and 3′ tamra.

[0390] For GLT1a the optimal reaction conditions were 0.6 Units Platinum Taq DNA polymerase, 20 mM Tris-HCl (pH 8.4), 50 mM KCl, 200 μM dGTP, 200 μM DATP, 200 μM dCTP, 400 μM dUTP, 0.4 Units UDG, 6.0 mM MgCl2, 200 nM Forward primer, 200 nM Reverse primer, 100 nM probe. Amplicon length was 76 bps. The forward primer was 5′ ATG AGT GCA AGG TAA CTC TGG 3′ and the reverse primer was 5′ TCA CGT TTC CAA GGT TCT TC 3′. The probe was 5′ CCA ATG GAA AGT CAG CTG ACT GCA 3′ labeled with 5′ 6-fam and 3′ BHQ1 (Black hole quencher 1).

[0391] 9.3.2.3 Data Analysis of Real Time PCR

[0392] Comparative amounts of GAPDH and GLT1 or GLT1a were determined using the DDCt method (Livak and Schmittgen, Methods 25:402-408 2001). The threshold for all reactions was set at 0.05 fluorescent units and the Ct for each sample determined by the Rotorgene software. Average Ct values for each GLT1 cDNA at each concentration were determined and subtracted from the average values for GAPDH. Samples were excluded if the GAPDH value is more than 15% above or below the mean GAPDH value. GLT1 mRNA increased an average of 1.34 fold in the drug treated animals and GLT1a mRNA increased an average of 1.27 in the drug treated animals. These values were consistent with the full change with GLT1 observed in AU rats in the microarray, which was an increase of 1.31.

9.4 Effect of Ceftriaxone Treatment on the Performance of Aged Rats in the RAM

[0393] After characterization for cognitive status in the MWM, impaired aged rats were assigned to one of two treatment conditions (control vehicle or ceftriaxone at 200 mg/kg) that were equated with respect to their MWM learning index scores. Initially rats in both treatment conditions were trained on the RAM task (habituation, no-delay version, and then delay of 60 seconds). Then daily injection of vehicle or drug occurred in the morning (8:00-9:00 AM). Rats received daily testing on the radial-arm maze with the delay extended to 3 hr. Critical tests at the 3 hour delay occurred on Days 8-10 (after 7 days of injection). Memory errors for aged rats receiving vehicle were significantly elevated relative to a young group tested concurrently. Compared to aged rats receiving vehicle, aged rats receiving drug administration had fewer errors (FIG. 6). This initial pilot assessment using 3 drug-treated rats and 4 aged rats treated with vehicle is currently being replicated with additional behaviorally characterized rats to increase sample sizes to 7-8 rats per group.

[0394] 9.4.1 Effect of Ceftriaxone Treatment on GLT1 mRNA in Aged Rats

[0395] At completion of RAM testing aged rats under vehicle and drug treatment are sacrificed. Dissected hippocampus is frozen and stored (−80° C.) for analysis of GLT1 mRNA, which is conducted at the completion of the replication of behavioral testing with additional aged rats. If drug treatment increases GLT1 mRNA, as expected based on the effect of drug treatment at the same dose in young rats, a microarray analysis is conducted to compare gene expression profiles in aged impaired rats treated with ceftriaxone relative to control aged impaired subject.

[0396] 9.4.2 Effect of Ceftriaxone Treatment on Test-retest in the MWM

[0397] Test-retest reliability is obtained when aged rats are characterized in the standard MWM protocol and then tested after weeks or months in a new spatial environment using the MWM. The preclinical efficacy of a compound to improve function in aged rats with cognitive impairment can be assessed in a re-test using the MWM. Effectiveness of a treatment across the two tasks (RAM and MWM) strengthens evidence for a drug action on cognitive function independent of other components that differ across the two tasks (e.g. motivational basis for performance).

9.5 Improved and Preserved Cognitive Function with other Compounds

[0398] This work includes administration of the test compound or vehicle to young rats in order to establish a treatment regimen that improves or preserves cognitive function. Test compounds include valproic acid, compounds that modulate metabotropic glutamate receptor (mGluR) activity and compounds that modulate pituitary adenyl cyclase activator polypeptide (PACAP) expression. Aged impaired rats characterized on the MWM are assigned to drug or vehicle treatment and tested in the RAM. Analysis of glutamate transporter expression and other gene profiling, such as mGluR expression and activity, PACAP expression or β-arrestin 2 expression are then conducted at the completion of behavioral testing

9.6 Equivalents

[0399] While specific embodiments of the subject invention have been discussed, the above specification is illustrative and not restrictive. Many variations of the invention will become apparent to those skilled in the art upon review of this specification. The appendant claims are not intended to claim all such embodiments and variations, and the full scope of the invention should be determined by reference to the claims, along with their full scope of equivalents, and the specification, along with such variations.

[0400] All publications and patents mentioned herein are hereby incorporated by reference in their entireties as if each individual publication or patent was specifically and individually indicated to be incorporated by reference. In case of conflict, the present application, including any definitions herein, will control.

[0401] The contents of each of the references cited in the present application, including publications, patents, and patent applications, are herein incorporated by reference in their entirety.

1 49 1 1566 DNA Homo sapiens 1 gaaatctctt gattcctagt ctctcgatat ggcacctccg tcagtctttg ccgaggttcc 60 gcaggcccag cctgtcctgg tcttcaagct cactgccgac ttcagggagg atccggaccc 120 ccgcaaggtc aacctgggag tgggagcata tcgcacggat gactgccatc cctgggtttt 180 gccagtagtg aagaaagtgg agcagaagat tgctaatgac aatagcctaa atcacgagta 240 tctgccaatc ctgggcctgg ctgagttccg gagctgtgct tctcgtcttg cccttgggga 300 tgacagccca gcactcaagg agaagcgggt aggaggtgtg caatctttgg ggggaacagg 360 tgcacttcga attggagctg atttcttagc gcgttggtac aatggaacaa acaacaagaa 420 cacacctgtc tatgtgtcct caccaacctg ggagaatcac aatgctgtgt tttccgctgc 480 tggttttaaa gacattcggt cctatcgcta ctgggatgca gagaagagag gattggacct 540 ccagggcttc ctgaatgatc tggagaatgc tcctgagttc tccattgttg tcctccacgc 600 ctgtgcacac aacccaactg ggattgaccc aactccggag cagtggaagc agattgcttc 660 tgtcatgaag caccggtttc tgttcccctt ctttgactca gcctatcagg gcttcgcatc 720 tggaaacctg gagagagatg cctgggccat tcgctatttt gtgtctgaag gcttcgagtt 780 cttctgtgcc cagtccttct ccaagaactt cgggctctac aatgagagag tcgggaatct 840 gactgtggtt ggaaaagaac ctgagagcat cctgcaagtc ctttcccaga tggagaagat 900 cgtgcggatt acttggtcca atccccccgc ccagggagca cgaattgtgg ccagcaccct 960 ctctaaccct gagctctttg aggaatggac aggtaatgtg aagacaatgg ctgaccggat 1020 tctgaccatg agatctgaac tcagggcacg actagaagcc ctcaaaaccc ctgggacctg 1080 gaaccacatc actgatcaaa ttggcatgtt cagcttcact gggttgaacc ccaagcaggt 1140 tgagtatctg gtcaatgaaa agcacatcta cctgctgcca agtggtcgaa tcaacgtgag 1200 tggcttaacc accaaaaatc tagattacgt ggccacctcc atccatgaag cagtcaccaa 1260 aatccagtga agaaacacca cccgtccagt accaccaaag tagttctctg tcatgtgtgt 1320 tccctgcctg cacaaaccta catgtacata ccatggatta gagacacttg caggactgaa 1380 aggctgctct ggtgaggcag cctctgttta aaccggcccc acatgaagag aacatccctt 1440 gagacgaatt tggagactgg gattagagcc tttggaggtc aaagcaaatt aagattttta 1500 tttaagaata aaagagtact ttgatcatga gaaaaaaaac aaaaaaaaaa aaaaaaaaaa 1560 aaaaaa 1566 2 413 PRT Homo sapiens 2 Met Ala Pro Pro Ser Val Phe Ala Glu Val Pro Gln Ala Gln Pro Val 1 5 10 15 Leu Val Phe Lys Leu Thr Ala Asp Phe Arg Glu Asp Pro Asp Pro Arg 20 25 30 Lys Val Asn Leu Gly Val Gly Ala Tyr Arg Thr Asp Asp Cys His Pro 35 40 45 Trp Val Leu Pro Val Val Lys Lys Val Glu Gln Lys Ile Ala Asn Asp 50 55 60 Asn Ser Leu Asn His Glu Tyr Leu Pro Ile Leu Gly Leu Ala Glu Phe 65 70 75 80 Arg Ser Cys Ala Ser Arg Leu Ala Leu Gly Asp Asp Ser Pro Ala Leu 85 90 95 Lys Glu Lys Arg Val Gly Gly Val Gln Ser Leu Gly Gly Thr Gly Ala 100 105 110 Leu Arg Ile Gly Ala Asp Phe Leu Ala Arg Trp Tyr Asn Gly Thr Asn 115 120 125 Asn Lys Asn Thr Pro Val Tyr Val Ser Ser Pro Thr Trp Glu Asn His 130 135 140 Asn Ala Val Phe Ser Ala Ala Gly Phe Lys Asp Ile Arg Ser Tyr Arg 145 150 155 160 Tyr Trp Asp Ala Glu Lys Arg Gly Leu Asp Leu Gln Gly Phe Leu Asn 165 170 175 Asp Leu Glu Asn Ala Pro Glu Phe Ser Ile Val Val Leu His Ala Cys 180 185 190 Ala His Asn Pro Thr Gly Ile Asp Pro Thr Pro Glu Gln Trp Lys Gln 195 200 205 Ile Ala Ser Val Met Lys His Arg Phe Leu Phe Pro Phe Phe Asp Ser 210 215 220 Ala Tyr Gln Gly Phe Ala Ser Gly Asn Leu Glu Arg Asp Ala Trp Ala 225 230 235 240 Ile Arg Tyr Phe Val Ser Glu Gly Phe Glu Phe Phe Cys Ala Gln Ser 245 250 255 Phe Ser Lys Asn Phe Gly Leu Tyr Asn Glu Arg Val Gly Asn Leu Thr 260 265 270 Val Val Gly Lys Glu Pro Glu Ser Ile Leu Gln Val Leu Ser Gln Met 275 280 285 Glu Lys Ile Val Arg Ile Thr Trp Ser Asn Pro Pro Ala Gln Gly Ala 290 295 300 Arg Ile Val Ala Ser Thr Leu Ser Asn Pro Glu Leu Phe Glu Glu Trp 305 310 315 320 Thr Gly Asn Val Lys Thr Met Ala Asp Arg Ile Leu Thr Met Arg Ser 325 330 335 Glu Leu Arg Ala Arg Leu Glu Ala Leu Lys Thr Pro Gly Thr Trp Asn 340 345 350 His Ile Thr Asp Gln Ile Gly Met Phe Ser Phe Thr Gly Leu Asn Pro 355 360 365 Lys Gln Val Glu Tyr Leu Val Asn Glu Lys His Ile Tyr Leu Leu Pro 370 375 380 Ser Gly Arg Ile Asn Val Ser Gly Leu Thr Thr Lys Asn Leu Asp Tyr 385 390 395 400 Val Ala Thr Ser Ile His Glu Ala Val Thr Lys Ile Gln 405 410 3 3533 DNA Homo sapiens 3 agcggaggag ccgggcgcgc ctgccacgca aaactaccgg gctggcaggg cggcgggcgc 60 ggtgcgcgat cccgggtggc ggcggcaacg gcggtggtga cggcggcgac tgcagcggcc 120 ggctctcacc tctcccctgt gcacccgcat ctcgccgcgc cgccgagcag ccagcagtcc 180 ccgggtcgcc cagcccacgc gcgcacggcc gagcccagcg cacaatagcg gcgacagcca 240 tggggaaacc ggcgaggaaa ggatgcgagt ggaagcgctt cctgaagaat aactgggtgt 300 tgctgtccac cgtggccgcg gtggtgctag gcattaccac aggagtcttg gttcgagaac 360 acagcaacct ctcaactcta gagaaattct actttgcttt tcctggagaa attctaatgc 420 ggatgctgaa actcatcatt ttgccattaa ttatatccag catgattaca ggtgttgctg 480 cactggattc caacgtatcc ggaaaaattg gtgtgcgcgc tgtcgtgtat tatttctgta 540 ccactctcat tgctgttatt ctaggtattg tgctggtggt gagcatcaag cctggtgtca 600 cccagaaagt gggtgaaatt gcgaggacag gcagcacccc tgaagtcagt acggtggatg 660 ccatgttaga tctcatcagg aatatgttcc ctgagaatct tgtccaggcc tgttttcagc 720 agtacaaaac taagcgtgaa gaagtgaagc ctcccagcga tccagagatg aacatgacag 780 aagagtcctt cacagctgtc atgacaactg caatttccaa gaacaaaaca aaggaataca 840 aaattgttgg catgtattca gatggcataa acgtcctggg cttgattgtc ttttgccttg 900 tctttggact tgtcattgga aaaatgggag aaaagggaca aattctggtg gatttcttca 960 atgctttgag tgatgcaacc atgaaaatcg ttcagatcat catgtgttat atgccactag 1020 gtattttgtt cctgattgct gggaagatca tagaagttga agactgggaa atattccgca 1080 agctgggcct ttacatggcc acagtcctga ctgggcttgc aatccactcc attgtaattc 1140 tcccgctgat atatttcata gtcgtacgaa agaacccttt ccgatttgcc atgggaatgg 1200 cccaggctct cctgacagct ctcatgatct cttccagttc agcaacactg cctgtcacct 1260 tccgctgtgc tgaagaaaat aaccaggtgg acaagaggat cactcgattc gtgttacccg 1320 ttggtgcaac aatcaacatg gatgggactg cgctctatga agcagtggca gcggtgttta 1380 ttgcacagtt gaatgacctg gacttgggca ttgggcagat catcaccatc agtatcacgg 1440 ccacatctgc cagcatcgga gctgctggcg tgccccaggc tggcctggtg accatggtga 1500 ttgtgctgag tgccgtgggc ctgcccgccg aggatgtcac cctgatcatt gctgtcgact 1560 ggctcctgga ccggttcagg accatggtca acgtccttgg tgatgctttt gggacgggca 1620 ttgtggaaaa gctctccaag aaggagctgg agcagatgga tgtttcatct gaagtcaaca 1680 ttgtgaatcc ctttgccttg gaatccacaa tccttgacaa cgaagactca gacaccaaga 1740 agtcttatgt caatggaggc tttgcagtag acaagtctga caccatctca ttcacccaga 1800 cctcacagtt ctagggcccc tggctgcaga tgactggaaa caaggaagga catttccgtg 1860 agagtcatct caaacactgc ttaaggaaaa gagaaacact aatggccaag tgtacatttg 1920 atttgatata cagacctcca gattattttc tatatttgga ttcacagcct ttgcgctctg 1980 ggttttggga tttgggtgtg gggtaagttg aagggaaatc aatttaaagg aaagttctat 2040 tatctgggtt ttagaaattc tataagagac aaagtttgga agtacataaa gtaataactg 2100 ttagaattag gtaatggata tgaaagagaa aatgctttct catgcataga caagtgtttt 2160 gggtttttaa aaaaaatatt ctgtcattgg ttacaaattt ttactcaggc tttctattgg 2220 catggatttc ctttgacctc tcactttttt ataaattata atgcatctaa accacctgtc 2280 cccagttaat gtgccaaaat gtcaattttt aacttatctc cagccaattt caaagaaaac 2340 agaccagcat agttctgcaa taacagtttt aagatgggca tagggtttgg aagaaagaga 2400 gaaggattct tttttcaatg tactgtattg ggacgctggt aactgttaac ccagtgttca 2460 gcatagagct atatatatat atatatgtat atatttatta ttttcatata atttgccaga 2520 cagagatcag aattgaaccg tcaatgtgaa ataaagagtt ctccttgtac ttgaataata 2580 accacgattc caacccaggt ctgctttggg gcttatcaga actcctttct aaggagcact 2640 agaatgagaa atcatgttgt tcgatcgttt cacatctgta tatcagctct aaagcagaga 2700 tgtattatgg tgatactcca aggtggcata gccattcatt tacaacttcc agatttgagc 2760 tgcctggagg gaatccatat cagctctgca taagattata tacaaagctg tcactcacaa 2820 aaggctggat gtgctttcat ccaactggaa ggctttattc ttccaagttc attcatactc 2880 aaagaggcca gtactttgcc atccttgcac tttctgttat cagggcccaa ataacagtgg 2940 caagctacca actaagttgt attttaataa agattccatg ggttgaacaa gccacgttgc 3000 agaaaaagag cttcccctaa cctgggttgt tgcagagtaa atcccacgac ataagctggt 3060 atcactggtt cgggggaaat agttccattc tatgactctt gtctcctcct ccaggaggac 3120 tgttctaact agtaatcttg gccctattca ttacatcctc tgcttgtcat tctgctaatt 3180 tatgaagata gtttattata gtctgtactt cagttctcat cttgtaaata atgcttaaca 3240 taaacttgta cttacactga aatccaaaat agtcatgttt ctgcagtatt ctgtagccaa 3300 cttaaacctg tgctttcatg tttaagaaat gagaaattgt gccaaagata gcagaagagt 3360 agataagtgc tcagtattga cgacctacat ctgaaatcta caacataatg atactgaatt 3420 gttatgtaaa catcataaat agtaaataat gattcaatgt gaattttaaa atgcaaatat 3480 tgctattgtt tataggaaat aaatctaaat ataaacgaaa aaaaaaaaaa aaa 3533 4 524 PRT Homo sapiens 4 Met Gly Lys Pro Ala Arg Lys Gly Cys Glu Trp Lys Arg Phe Leu Lys 1 5 10 15 Asn Asn Trp Val Leu Leu Ser Thr Val Ala Ala Val Val Leu Gly Ile 20 25 30 Thr Thr Gly Val Leu Val Arg Glu His Ser Asn Leu Ser Thr Leu Glu 35 40 45 Lys Phe Tyr Phe Ala Phe Pro Gly Glu Ile Leu Met Arg Met Leu Lys 50 55 60 Leu Ile Ile Leu Pro Leu Ile Ile Ser Ser Met Ile Thr Gly Val Ala 65 70 75 80 Ala Leu Asp Ser Asn Val Ser Gly Lys Ile Gly Val Arg Ala Val Val 85 90 95 Tyr Tyr Phe Cys Thr Thr Leu Ile Ala Val Ile Leu Gly Ile Val Leu 100 105 110 Val Val Ser Ile Lys Pro Gly Val Thr Gln Lys Val Gly Glu Ile Ala 115 120 125 Arg Thr Gly Ser Thr Pro Glu Val Ser Thr Val Asp Ala Met Leu Asp 130 135 140 Leu Ile Arg Asn Met Phe Pro Glu Asn Leu Val Gln Ala Cys Phe Gln 145 150 155 160 Gln Tyr Lys Thr Lys Arg Glu Glu Val Lys Pro Pro Ser Asp Pro Glu 165 170 175 Met Asn Met Thr Glu Glu Ser Phe Thr Ala Val Met Thr Thr Ala Ile 180 185 190 Ser Lys Asn Lys Thr Lys Glu Tyr Lys Ile Val Gly Met Tyr Ser Asp 195 200 205 Gly Ile Asn Val Leu Gly Leu Ile Val Phe Cys Leu Val Phe Gly Leu 210 215 220 Val Ile Gly Lys Met Gly Glu Lys Gly Gln Ile Leu Val Asp Phe Phe 225 230 235 240 Asn Ala Leu Ser Asp Ala Thr Met Lys Ile Val Gln Ile Ile Met Cys 245 250 255 Tyr Met Pro Leu Gly Ile Leu Phe Leu Ile Ala Gly Lys Ile Ile Glu 260 265 270 Val Glu Asp Trp Glu Ile Phe Arg Lys Leu Gly Leu Tyr Met Ala Thr 275 280 285 Val Leu Thr Gly Leu Ala Ile His Ser Ile Val Ile Leu Pro Leu Ile 290 295 300 Tyr Phe Ile Val Val Arg Lys Asn Pro Phe Arg Phe Ala Met Gly Met 305 310 315 320 Ala Gln Ala Leu Leu Thr Ala Leu Met Ile Ser Ser Ser Ser Ala Thr 325 330 335 Leu Pro Val Thr Phe Arg Cys Ala Glu Glu Asn Asn Gln Val Asp Lys 340 345 350 Arg Ile Thr Arg Phe Val Leu Pro Val Gly Ala Thr Ile Asn Met Asp 355 360 365 Gly Thr Ala Leu Tyr Glu Ala Val Ala Ala Val Phe Ile Ala Gln Leu 370 375 380 Asn Asp Leu Asp Leu Gly Ile Gly Gln Ile Ile Thr Ile Ser Ile Thr 385 390 395 400 Ala Thr Ser Ala Ser Ile Gly Ala Ala Gly Val Pro Gln Ala Gly Leu 405 410 415 Val Thr Met Val Ile Val Leu Ser Ala Val Gly Leu Pro Ala Glu Asp 420 425 430 Val Thr Leu Ile Ile Ala Val Asp Trp Leu Leu Asp Arg Phe Arg Thr 435 440 445 Met Val Asn Val Leu Gly Asp Ala Phe Gly Thr Gly Ile Val Glu Lys 450 455 460 Leu Ser Lys Lys Glu Leu Glu Gln Met Asp Val Ser Ser Glu Val Asn 465 470 475 480 Ile Val Asn Pro Phe Ala Leu Glu Ser Thr Ile Leu Asp Asn Glu Asp 485 490 495 Ser Asp Thr Lys Lys Ser Tyr Val Asn Gly Gly Phe Ala Val Asp Lys 500 505 510 Ser Asp Thr Ile Ser Phe Thr Gln Thr Ser Gln Phe 515 520 5 11692 DNA Homo sapiens 5 caccctcgga gcccccggag ctccccgcca agcgccatcc ccgcgggcgg aggggagcgc 60 gggtcgcgcg ccgtggagag ccgggacgcg gattagcgcc cgcaggagcc tcctgcgccc 120 gttgaggcgc taaagggctt accccggagg cgggtggaag ggcgggcaga ggctcctctt 180 aaataccgct cccggccgca cttcgcgctc accccggcgt ccgctttctc cctcgcccac 240 agctgccgga tagtgctgaa gaggaggggg cgttccccag accatggcat ctacggaagg 300 tgccaacaat atgcccaagc aggtggaagt gcgaatgcac gacagtcatc ttggctcaga 360 ggaacccaag caccggcacc tgggcctgcg cctgtgtgac aagctgggga agaatctgct 420 gctcaccctg acggtgtttg gtgtcatcct gggagcagtg tgtggagggc ttcttcgctt 480 ggcatctccc atccaccctg atgtggttat gttaatagcc ttcccagggg atatactcat 540 gaggatgcta aaaatgctca ttctccctct aatcatctcc agcttaatca cagggttgtc 600 aggcctggat gctaaggcta gtggccgctt gggcacgaga gccatggtgt attacatgtc 660 cacgaccatc attgctgcag tactgggggt cattctggtc ttggctatcc atccaggcaa 720 tcccaagctc aagaagcagc tggggcctgg gaagaagaat gatgaagtgt ccagcctgga 780 tgccttcctg gaccttattc gaaatctctt ccctgaaaac cttgtccaag cctgctttca 840 acagattcaa acagtgacga agaaagtcct ggttgcacca ccgccagacg aggaggccaa 900 cgcaaccagc gctgttgtct ctctgttgaa cgagactgtg actgaggtgc cggaggagac 960 taagatggtt atcaagaagg gcctggagtt caaggatggg atgaacgtct taggtctgat 1020 agggtttttc attgcttttg gcatcgctat ggggaagatg ggagatcagg ccaagctgat 1080 ggtggatttc ttcaacattt tgaatgagat tgtaatgaag ttagtgatca tgatcatgtg 1140 gtactctccc ctgggtatcg cctgcctgat ctgtggaaag atcattgcaa tcaaggactt 1200 agaagtggtt gctaggcaac tggggatgta catggtaaca gtgatcatag gcctcatcat 1260 ccacgggggc atctttctcc ccttgattta ctttgtagtg accaggaaaa accccttctc 1320 cttttttgct ggcattttcc aagcttggat cactgccctg ggcaccgctt ccagtgctgg 1380 aactttgcct gtcacctttc gttgcctgga agaaaatctg gggattgata agcgtgtgac 1440 tagattcgtc cttcctgttg gagcaaccat taacatggat ggtacagccc tttatgaagc 1500 ggtggccgcc atctttatag cccaaatgaa tggtgttgtc ctggatggag gacagattgt 1560 gactgtaagc ctcacagcca ccctggcaag cgtcggcgcg gccagtatcc ccagtgccgg 1620 gctggtcacc atgctcctca ttctgacagc cgtgggcctg ccaacagagg acatcagcct 1680 gctggtggct gtggactggc tgctggacag gatgagaact tcagtcaatg ttgtgggtga 1740 ctcttttggg gctgggatag tctatcacct ctccaagtct gagctggata ccattgactc 1800 ccagcatcga gtgcatgaag atattgaaat gaccaagact caatccattt atgatgacat 1860 gaagaaccac agggaaagca actctaatca atgtgtctat gctgcacaca actctgtcat 1920 agtagatgaa tgcaaggtaa ctctggcagc caatggaaag tcagccgact gcagtgttga 1980 ggaagaacct tggaaacgtg agaaataagg atatgagtct cagcaaattc ttgaataaac 2040 tccccagcgt atcctatggt aactgatggt ataaacaagc tttctttaaa aaggaaaaaa 2100 atgcgtatat ttctatgttt acttaatctg ttagccgagg cttagaggag ctcttctgag 2160 tcagtgatga caggcacggt gctgtgtctt tgccaaataa tgctttataa ccgtctaatt 2220 ttctcacttg tattattatt tgaatggatg ctgctggagg aatcagttgg aattgaagac 2280 acgttcttgc cagcttccct tttctcccaa gatgcagaaa tgtggatgct cttttcccag 2340 gggacatgag taaagcagtg tggtacactc cagggacttg ggaaaatgag caaacacaca 2400 gcgtgttatt ccttaaagtg ttctccatgt ctcgccttgt tatgcacaag agattctatt 2460 aaaagcctct agaagtaact ccccttaaaa tgtctagtaa agcttgcaca tggattgatt 2520 aaaagcaaat acctgtctta gggaattctg acaatttatc ttccatatgc tcttcaagta 2580 aaatgtttca aagacagttt taaagggagc catgctctta aaggcagttg attaaagaac 2640 ctgttacatc tctgccttac cctgtgtaat ctgtgagaac aatggttgaa atttcaaagt 2700 atgtttcatt attctttcta aatttgataa ttgattagga agtattttat atataaccac 2760 tgtagatatt gacaaaagta agagagcaca gtcaacataa agtttaacca gagttaaata 2820 ttcaaattta tttatgttcg gtttgccttc acctggtgta gtaaaatcaa atgagattat 2880 ttggtatgtg ttttgctttg tttaacccaa aagattattt tttggttcct aaagaataag 2940 atatttttaa tctgtcaatt atttcagtca tcatctcatc gtcctagaaa gcctggtcat 3000 tttactgtcc tcacacatgg aagacacttg tgttggctgt gcctctttta atgtcatctc 3060 ttgatgaaac aaaaatactg gtatttatcc aatatgttga aagcgttttg ttatcttcta 3120 tcaaagcaca taaaaccagt tataggacac agttggaagc agagccagtc ttcctcctca 3180 gtcttctcag tagaaaggaa cagaaaacaa acatctgtgc aaaaccctat aaaagtgata 3240 ttcctatggc agtgtccagg cagccttgaa caatgacatg gcaacaaagg atggctctgt 3300 acaacttcac agatggaatc ttgtttaagg ctgtgaagtt ttaaggaaaa aataatataa 3360 ttggtgtagc acagtgcctt gcatgcagta ggcagccaac aaatccttaa tgattggtaa 3420 tgcaagctaa aatttttaat gtctctacta tcctttttta ttgaataacg cacagtattt 3480 tcctaaggtt gctgggtttg ggagtaagct tgggacttga atcattgaga aataaagcac 3540 atttgactct aagaagtatg tagaaccttg tttataggtg agaataggct gtattacatg 3600 aaaccagcgg agggatgctg agacagtcat ggtttataga atccaaattc tctgagatga 3660 gggtgcttat ggacctcaaa tagcctttct tatctgctcc ttgtgacttc tctcttttaa 3720 atcagatacc tttgaaacat acaagagaca tgagattcca gttgtttttg tctcccaaac 3780 cctagtgatt tttggataag gatcactact gaacagtatt aggcaacttt tcatttttgc 3840 ttttcaaatg acattggtct ttatttccag tagatgggca gtaaacaggg tcagggtggt 3900 caagggatat gattaagtgt agtagtagaa aatgcattct tttttttttt ttttagacag 3960 agtcttgctc tgttgctcag gctggagtac agtggcacga tctcagctca ctgcactgca 4020 atgtccacct ttctagttca agcgattctc ctgtctcagc cacccgagta gctgggacta 4080 caggcacatg ccaccatgcc tgggtaattt tttgtatttt tagtacagat gggtttcatt 4140 atgttagcca ggatgggctc gatctcctga cctcatgatc cacccgccat ggcctcccaa 4200 agtgctggga ttacaggcgt gagccaccat gcccggccag aaaatacatt ctttacagca 4260 aatgttatag gttaaggaga cagttcctgg gtttagctgt gaaactgttt aggattttct 4320 ctaagtggtg ttcccaagag cttttttctg gtaagttgga ggctataggc aaagcgtcta 4380 ggagcttgtt gttttttgct atggaaggcc agctttggga tctatataca tgggcatgtt 4440 tctcaatgga gtggtcacag gattctcttc tcagaactcc gcctgaaacc tttccggtgt 4500 gttttgtctt caactttttc ctagctatcc tattagcctg taaatatgtc agcagacaat 4560 gtgtttggaa gaaacaaggt gtacaacttc acaaagcaag ttttttctgg tctgagactt 4620 actgtgtgca ggaagtctcg tgagaatata cttagaaaat cacacttatc tggaaattca 4680 ataagtcata ttaaaaatat acaatttctt gaactcagga ctagcctgaa aatccaaaat 4740 gcatgtagag ttgacatgtt gacatatgtc tatgtcctct gtttttatgt aagcctcatt 4800 tatttctccc ttgccatgcc catttgggat tttctttgcc cagcatatgt gccataaaca 4860 gaacaaatgg atccagggta tacctctaat aacctttatc taaatagaat acatttaacc 4920 tgtcaattat atgtatagat ttatcctaat tggttcaaca tatatatgaa tgcttgtaat 4980 agtctgttac tatttgataa tgcataaatg ttatatttat atacagttac agcaggccag 5040 aaacatacat acactgcatc ctgtaaggtc tgtaaatcag ggcttcggca atgccatttt 5100 gctatagact cattcatgtt ttttaggaat cagaatcatt tcaggagaaa accaactaag 5160 tagactttaa ttcttctctt tctcattctt gccctttgag aaacccaagt ggaatcatca 5220 cttaattact tactcaaatt caatgaatca aatccaatcc aatcagtgag aagaatagct 5280 tatctttcaa ggtaatactt taccttagat ctttgagggc atgcttccca tacaaagagc 5340 atcagaaaat cagggcaatc atctagatct cacagatctt caacatagga ctctaggtta 5400 tgagagctga accacatagc tatcagtttc agtttttact tcctattgag tgtgtagaaa 5460 cacagcctat ggttttgctg tctgcagaag ttaccatacc ctgtaaaaga caaatatttc 5520 tttctcctct cgaaccacta gtcagcctgg atattcatga aagctgaaag aaggggtgcc 5580 aaaatgccta gctcctggat ttagactcat ggaatatgag aaaggcttag ggaaaattca 5640 agtcatgggg agattcctgc taggtgatgc caggaatctc aatgacctgt tgattagagg 5700 gtctctggtc atttccactt gtgtgtggag aaatacatca tttgcatatg tatctagtca 5760 agaagccctt ttggagcaaa gtggttggga aactcattta ctgggtccac caaaggaaac 5820 ccaattctgc catggttttg caaatactac acgtcccttt gctctcaata gcaaaagtga 5880 tgttatagtt ttttttactt ttgtcagaat caaggcttcg gccacctgtg caatattttc 5940 ccatggagtg cagcagtaaa tccaagtagg tatatccgtg ctcagagaga attctccata 6000 gttctgactc ctgaaaaaca agtctcatta gctctcatct agtttgatga ctaaaagtgc 6060 acagcagtta taccccagtg gaagtgattt ttctcaccta gctgctctct aactcttacg 6120 gctatctagt tgtctgagct gctggtttgt tttacttatt ctccaactga actcattcca 6180 aaccaagaaa aaggaccaaa tgaaattttc agttattatc atacttgctc cttgaataag 6240 aatgctagag tctggaggca gaagaagact aagttaaaaa aacagcctga aggaattctg 6300 gagtgagttt tgtaagtcta attgtcaaag aagataaatg gtctcagtat ctgtagtaat 6360 ttctgggtaa gcctagaggg atcactgagt gccaatccaa gcaggtggca ctagaggcag 6420 attgttctcc tagcatagag tgagtttcca gctgctcatg acacattttc ttatgatttt 6480 ccctaaaaga aaaccaaatt ttctactcca cctcttcctt tcctatcatt ccacttattg 6540 ttgctgcttt gctaacagcc agcttagtaa gccatcacac aagttagtgt gaggcttggt 6600 tttaaaatag ataacagaga atcttgatta aatagtatcc aactttgtag cataacacag 6660 ctaattcagg caatgacata gagatgataa gaaacaacat ggtttggtag agggaacatt 6720 tgatttagac tctgcccatt tttagctgta tgacttacat aagtcatttt gtgtccaagc 6780 ctcattttct cccatatgaa aagtgaaggg gttggattaa atgactaaaa tccccttcca 6840 gccctatgag cccaatgtat tatgatctct gctttgtttc cttcttaaga ggcttcctac 6900 tataaaatgt gacctattta cattttaagt tgaagtagcc cacaataatg aataatcaat 6960 ttagattttt ctcatctcct ttgggagaaa ttaaattcaa gcctctattc atttgatgtt 7020 ttacaacaag cttcaaagtt ggccatgttc attcacagtt tgatattttg agacaccaat 7080 aaaagttttt taataaaagt tcccttgact taaactcacc ttcctaatag aagtagtgat 7140 ttggctcacg gaaaaatgtt tccagagtca acatgagagg actggatgaa cctacagcct 7200 cactcaagct gttgcatcat tgctgctgta gcaacagagc cctctctgaa tacccaaaac 7260 acacaatttt ctccaacaat tctaattgcc cacccagtat gggaatttga gagcctttgg 7320 tgaataacct tgatcactct tttgaaattt ggtgcaacac ttgcacaatc acggcaactg 7380 tttgtaaata tcttttcttc tccgccagta tgtagacatg cacacccatt cacaagtaag 7440 cccgcaaatt cattttcaca ttttcctctt ggtttgtttt tatcatgggt atcacattgc 7500 aggaagagat gctctaactt acaagaatgg aattttctga ctatcattgc cctcttcaag 7560 gcctgattag tttgtagggt ggatgggata caatgcatgg tctacaaggt ccatgtcacc 7620 agccatgatc tttaaacaac ctcttgcact ttctggcact gctgcaatgg cccatggtgg 7680 acacctctgc tgtttgtcca aacttctgct aaatagagta gtgttggtat ttgttgtgtt 7740 gtgaactcct ttgttgtcag agaagacttc agctcttcag gtttgcaagg gcaagatgta 7800 aagctaggac tgaaaggatt gggttttgtt tttcggccgg cttttttcca ttagtgcaaa 7860 aaacctgatg caagggcagc aagataaaca caatttcttc attgacccag ataagaacct 7920 gtagaaaagt gagagatgcc cctaagttcc tatttcattt atcctaaata atatcctgaa 7980 taactactga ttagtttttt tctatgtcca gccttgcttc actttccaac ctccaggcca 8040 aatattcatt ttgatatttt acaaaatcct tctggttttt acacagggcc cttttgtgct 8100 gagaataaaa ccagctattg gaacagacct cactgctttg tagagtcaga agttgaccta 8160 gagaatgggt agcaacaggt cctcagtttc tctggagaca cccccaatcc ttgggtgagt 8220 gttgaaaccc gacaactgac tgttagcctt gtcatcatcc tcactggaga tggagatgtt 8280 gcagtcggag cctcactcct tctggggcac tgtttcttct catggcaacc attttgcctg 8340 cctggaaaca gcctgttgcc tttgctaggg ggcaaataca cacactagcc aaggaagttg 8400 atccagcaca tctctgaggt ttctccccca tggacccatg gagcgaattc cctgagataa 8460 atgatcagaa agttggctgt ctggtggttt aatactccat aagagttact tcccttatag 8520 aaagtcatcc tttggcatct agatttttct acaaatcacc ttaactccct ccctttctct 8580 ttctcctgga aaacctcagc attgcatctc catgttgcac aacacagatc agattatctg 8640 gtcactatag agattttgta tataaaaaaa ctactttttt gaggattttg gatattgggt 8700 tttctatttg ttttccacag cttgaggaaa gagctggcaa atctgtgaat ctaacttgat 8760 cttgttgcca gcagatatat tttggcttcc tggtaagagt ctgtgttccc agggacaata 8820 tattgccctc tgatcagtat tgcaccctaa atccacaaat tcctccagcc cagccttaca 8880 tttttttaaa caaatcccct agcttgtttg ctattattta taaagagcat tagaaaatgt 8940 atttatagac ctggatgtta tatgtttaat attaatttag cctttaataa tgttataggt 9000 ttgtaactat tcttcagaaa ttataaagaa ctcagtgtag actgaattaa tctatttagt 9060 atctgtaatt ttgcagacat attttcttac agtattttct atgtaaccac acatgtagaa 9120 ttataactaa ttagagcaca agaagtttct tcagcaattt agagctacca attgtttctt 9180 gactatatag tcacatttga caaatttaag aacccagtct ttagtgatgc aataaacaaa 9240 atgaaccatt aagaacaagg aattgcttaa atcccttagc tggtgaggat atacatctaa 9300 ataattcatc tttctaactc aaggaatggt gctgattttt taaatgtttg acaccaggcc 9360 ttgtttttcc agctgagcat tctcattttg cttttctcta agactatcaa agacaaggta 9420 ttaatagtag gattacttcc tagatcagaa tgtttcatac attcctaaag gtttatgtgg 9480 aaattggctt aggaaaactt tgagtagcag agtctgagga tgagtgctag agatgaaatc 9540 aggacagatt tgttgcggtt aattcttgcc aagcaaatta gtggtaaatg tcacgttgtt 9600 atgtgaattg agcacacatt tttaaagaaa gtttacaaaa aatttttaga accaactatg 9660 aggcaatact gtatcactgg gggctgggag tgggggctta gaatcatact gaaattgttt 9720 aaaagcagcc caggtagttt ctgtcctcag gtgaataaca gactatataa cttccccgaa 9780 aggtaaaacg atagccactg cagagacgag gtgtcttcct tccaccaaat actttctgag 9840 atggtggtag gagtagtatt tttaccgtgg tttaaaaagt agtcagttac atagaaagtg 9900 tattggtatg tatttgaacc tgctcttcat taaacaagca gattagatgt accctcgact 9960 ggcaataatt gtatctattt tcaagtacag ctagctgtca aagcatgaag ctcttgtgta 10020 tacacactga cacttggttc acgcatgaag aacagtgcct atgcactttg tgtagctata 10080 atgtaggtat ctaggtgtaa tttcagtgaa atggtgtata gatgtattgt aatttaaatg 10140 tatatgttat ttttggctat tcatctaaat gcagtagaca tgttgatcgg tgttttgcaa 10200 acatttcttt ttccttctta gataactagt cgtgaatcat ttctcctctt tctcagtgtg 10260 gcttgggaat atatatgagt gaagaattta tctgtgaatc ctttgtactg atgattgttt 10320 gaaagtctgt gtgtgtccag cacctttgta aatacgcaat tcagagcagg gatgggctgg 10380 gtgtgtgtcc tggttcttag tgaaaggtca tctcatgtct gtttaataca tggtgaatgc 10440 aactgtggaa cttttgatta cctagactta ggtaggttta gaatgagaac atccatctac 10500 agtcctcctt tgcttggtgg attgggctca gaggaacaaa aagttagtct gactctgtgc 10560 atattagcat catgtcttta gagaaaggtc agcctctctg gttgccaaat acccatcatg 10620 atgctcatga cttaaaggtt ctgaggagcc ttgtctccct tggatttttg agtcagggta 10680 caggaaaaaa cattgccgac taactaactg caaatgcatc tgcaggtgaa accctacgaa 10740 agcacagttc tggctataaa cttcagagtt ctctgtaaaa aacttagagc actagaagca 10800 caggaatagt gagtgtacag cttatgcggt tgtagagggg caactgatga acacaggtcc 10860 cacatatatg agggagtatg acgttctcta cctaatatgt tctgtgtgca tgttttgaat 10920 gattgaagat gggattaact aatgcaagtt tacagttgcc tcctaaaaca cacattctgt 10980 ataattatcg ctaatacaat gctgtgaggt ctatagttcc tgtaacccct ttctcctccc 11040 caaggacaga gaagaactag ccatgtgcta tagggaaccc tgagtgccct actcttttcc 11100 caagaagggt aaagcctaca atatcatcag ggggcatgaa gcacattaat ttgcagtggc 11160 tgcttcatat gaggagatat ggtggacagg ctaatttttc cttgaaaatg tggcttcttc 11220 aactcctttc aaatttagga tggaatactt cctgaaataa aactgggctt tatgcaggat 11280 tctctttgaa aattcttgta tgtccagaac aaaagataaa actaattgta ttcctcacat 11340 tcacaatccc cattggtctg aagtcacgta gcacagagca tctatagcac atagtgttta 11400 aagactaatg aatgcaaaaa gataaaatct tcaactaatt tttgaattgt ttctcatata 11460 tgctactaga aaatgccttg ttgatgaagc acattttggg tagttgaggt cttttgtttt 11520 cgcctttagc tttctaagct ttcttacaat gtggactgat tactgtaaca tttcacgtgt 11580 aaaataactg gatattcttt atatactgga aataacctgt gaatccaata tttcactaag 11640 tgttttaact tttgtgtata tatctctcat caataaatgt ggatttcaat tt 11692 6 574 PRT Homo sapiens 6 Met Ala Ser Thr Glu Gly Ala Asn Asn Met Pro Lys Gln Val Glu Val 1 5 10 15 Arg Met His Asp Ser His Leu Gly Ser Glu Glu Pro Lys His Arg His 20 25 30 Leu Gly Leu Arg Leu Cys Asp Lys Leu Gly Lys Asn Leu Leu Leu Thr 35 40 45 Leu Thr Val Phe Gly Val Ile Leu Gly Ala Val Cys Gly Gly Leu Leu 50 55 60 Arg Leu Ala Ser Pro Ile His Pro Asp Val Val Met Leu Ile Ala Phe 65 70 75 80 Pro Gly Asp Ile Leu Met Arg Met Leu Lys Met Leu Ile Leu Pro Leu 85 90 95 Ile Ile Ser Ser Leu Ile Thr Gly Leu Ser Gly Leu Asp Ala Lys Ala 100 105 110 Ser Gly Arg Leu Gly Thr Arg Ala Met Val Tyr Tyr Met Ser Thr Thr 115 120 125 Ile Ile Ala Ala Val Leu Gly Val Ile Leu Val Leu Ala Ile His Pro 130 135 140 Gly Asn Pro Lys Leu Lys Lys Gln Leu Gly Pro Gly Lys Lys Asn Asp 145 150 155 160 Glu Val Ser Ser Leu Asp Ala Phe Leu Asp Leu Ile Arg Asn Leu Phe 165 170 175 Pro Glu Asn Leu Val Gln Ala Cys Phe Gln Gln Ile Gln Thr Val Thr 180 185 190 Lys Lys Val Leu Val Ala Pro Pro Pro Asp Glu Glu Ala Asn Ala Thr 195 200 205 Ser Ala Val Val Ser Leu Leu Asn Glu Thr Val Thr Glu Val Pro Glu 210 215 220 Glu Thr Lys Met Val Ile Lys Lys Gly Leu Glu Phe Lys Asp Gly Met 225 230 235 240 Asn Val Leu Gly Leu Ile Gly Phe Phe Ile Ala Phe Gly Ile Ala Met 245 250 255 Gly Lys Met Gly Asp Gln Ala Lys Leu Met Val Asp Phe Phe Asn Ile 260 265 270 Leu Asn Glu Ile Val Met Lys Leu Val Ile Met Ile Met Trp Tyr Ser 275 280 285 Pro Leu Gly Ile Ala Cys Leu Ile Cys Gly Lys Ile Ile Ala Ile Lys 290 295 300 Asp Leu Glu Val Val Ala Arg Gln Leu Gly Met Tyr Met Val Thr Val 305 310 315 320 Ile Ile Gly Leu Ile Ile His Gly Gly Ile Phe Leu Pro Leu Ile Tyr 325 330 335 Phe Val Val Thr Arg Lys Asn Pro Phe Ser Phe Phe Ala Gly Ile Phe 340 345 350 Gln Ala Trp Ile Thr Ala Leu Gly Thr Ala Ser Ser Ala Gly Thr Leu 355 360 365 Pro Val Thr Phe Arg Cys Leu Glu Glu Asn Leu Gly Ile Asp Lys Arg 370 375 380 Val Thr Arg Phe Val Leu Pro Val Gly Ala Thr Ile Asn Met Asp Gly 385 390 395 400 Thr Ala Leu Tyr Glu Ala Val Ala Ala Ile Phe Ile Ala Gln Met Asn 405 410 415 Gly Val Val Leu Asp Gly Gly Gln Ile Val Thr Val Ser Leu Thr Ala 420 425 430 Thr Leu Ala Ser Val Gly Ala Ala Ser Ile Pro Ser Ala Gly Leu Val 435 440 445 Thr Met Leu Leu Ile Leu Thr Ala Val Gly Leu Pro Thr Glu Asp Ile 450 455 460 Ser Leu Leu Val Ala Val Asp Trp Leu Leu Asp Arg Met Arg Thr Ser 465 470 475 480 Val Asn Val Val Gly Asp Ser Phe Gly Ala Gly Ile Val Tyr His Leu 485 490 495 Ser Lys Ser Glu Leu Asp Thr Ile Asp Ser Gln His Arg Val His Glu 500 505 510 Asp Ile Glu Met Thr Lys Thr Gln Ser Ile Tyr Asp Asp Met Lys Asn 515 520 525 His Arg Glu Ser Asn Ser Asn Gln Cys Val Tyr Ala Ala His Asn Ser 530 535 540 Val Ile Val Asp Glu Cys Lys Val Thr Leu Ala Ala Asn Gly Lys Ser 545 550 555 560 Ala Asp Cys Ser Val Glu Glu Glu Pro Trp Lys Arg Glu Lys 565 570 7 3983 DNA Homo sapiens 7 ggtagtaact tgcagtttca gagcacatgc acactgtcag ggctagcctg cctgcttacg 60 cgcgctgcgg attgttgctc cgttgtacct gctggggaat tcacctcgtt actgctcgat 120 atcttccacc ccttacaaaa tcagaaaagt tgtgttttct aataccaaag aggaggtttg 180 gctttctgtg ggtgattccc agacactgaa gtgcaaagaa gagaccctcc tagaaaagta 240 aaatatgact aaaagcaatg gagaagagcc caagatgggg ggcaggatgg agagattcca 300 gcagggagtc agtaaacgca cacttttggc caagaagaaa gtgcagaaca ttacaaagga 360 ggatgttaaa agttacctgt ttcggaatgc ttttgtgctg ctcacagtca ccgctgtcat 420 tgtgggtaca atccttggat ttaccctccg accatacaga atgagctacc gggaagtcaa 480 gtacttctcc tttcctgggg aacttctgat gaggatgtta cagatgctgg tcttaccact 540 tatcatctcc agtcttgtca caggaatggc ggcgctagat agtaaggcat cagggaagat 600 gggaatgcga gctgtagtct attatatgac taccaccatc attgctgtgg tgattggcat 660 aatcattgtc atcatcatcc atcctgggaa gggcacaaag gaaaacatgc acagagaagg 720 caaaattgta cgagtgacag ctgcagatgc cttcctggac ttgatcagga acatgttccc 780 tccaaatctg gtagaagcct gctttaaaca gtttaaaacc aactatgaga agagaagctt 840 taaagtgccc atccaggcca acgaaacgct tgtgggtgct gtgataaaca atgtgtctga 900 ggccatggag actcttaccc gaatcacaga ggagctggtc ccagttccag gatctgtgaa 960 tggagtcaat gccctgggtc tagttgtctt ctccatgtgc ttcggttttg tgattggaaa 1020 catgaaggaa caggggcagg ccctgagaga gttctttgat tctcttaacg aagccatcat 1080 gagactggta gcagtaataa tgtggtatgc ccccgtgggt attctcttcc tgattgctgg 1140 gaagattgtg gagatggaag acatgggtgt gattgggggg cagcttgcca tgtacaccgt 1200 gactgtcatt gttggcttac tcattcacgc agtcatcgtc ttgccactcc tctacttctt 1260 ggtaacacgg aaaaaccctt gggtttttat tggagggttg ctgcaagcac tcatcaccgc 1320 tctggggacc tcttcaagtt ctgccaccct acccatcacc ttcaagtgcc tggaagagaa 1380 caatggcgtg gacaagcgcg tcaccagatt cgtgctcccc gtaggagcca ccattaacat 1440 ggatgggact gccctctatg aggctttggc tgccattttc attgctcaag ttaacaactt 1500 tgaactgaac ttcggacaaa ttattacaat cagcatcaca gccacagctg ccagtattgg 1560 ggcagctgga attcctcagg cgggcctggt cactatggtc attgtgctga catctgtcgg 1620 cctgcccact gacgacatca cgctcatcat cgcggtggac tggttcctgg atcgcctccg 1680 gaccaccaca aacgtactgg gagactccct gggagctggg attgtggagc acttgtcacg 1740 acatgaactg aagaacagag atgttgaaat gggtaactca gtgattgaag agaatgaaat 1800 gaagaaacca tatcaactga ttgcacagga caatgaaact gagaaaccca tcgacagtga 1860 aaccaagatg tagactaaca taaagaaaca ctttcttgag caccaggtgt taaaaaccat 1920 tataaaatct ttccatctca ttacagctca ttcgctccag caagcccgtc atcttccctt 1980 tcctcccttc tgataagact ggaaaatagt cctccaaaac acaagggagg attttgggtg 2040 gccaaagtgt acaattttca tcccacaatt gaaattttta aatcatttca tgttagtctt 2100 accgaataag gtaccaagat cacaaatagt gttgatcaga tcttacaagt ttatgtggca 2160 cacaatccta taaatgtgat ttttttatat aagttaaaga gacaaatagt aggctaaaaa 2220 cattttaaaa tcaacttttg aaatttaaaa atctttcaga atacaattca gttttagttt 2280 caaaatgtta acaacttgaa ttacaaccgg ttatcagttg gacagtaaga ttttatccct 2340 ttctcttctg actggtatac ctatttcatt agtagctagg tgcacatata catctagcac 2400 agctgtgagg acagacagaa ggcaaagttt ccatgtggcc ttgagcaagt cccatctcac 2460 ctctaggcct cagtgtcctc atctataaaa tgagggactt ccctagaagt cttcatggtc 2520 tcttccagcc cagacatcct gtgatgtcat gaaagcacct gccctctgtt tcccctcaga 2580 acaccctgta ccatccatgg agcacgaggc cttcagaaaa gacacttcaa tgggagtgaa 2640 catttctaac taaggacagg atggctgtgt gtggtggtca ccaggtcctg tgagcaaagt 2700 gcaggttatg caagtcgcca ggcaggaggc cattccagga gtgggattat tcatcaaact 2760 ctttgcccag ttcatcccaa tgggggaagt attcccttct ttcctactct gggaagaatg 2820 tctcctgcca ctcctcaact gatgatagac ttcgaaaaca gatgagaaga ctagcagcta 2880 gcaagggtgc ttgtagtcac actgtggaac actaaagagc taggaaagag ttgagcacag 2940 gcaacattac aaacaaagga tttgaaaaca ccaagagtac aggtcttctt taaggaagaa 3000 taaaaaagaa gaggttcatt tttctggctt tttttttcac ctgaaacact ttttctcgag 3060 tccaaaatca ttccccccgt gaagtctgct taccaaaaca taagacgact tatatatttg 3120 aaagaagtca aatgaatgag ctctctaata gaagtccatg agttgagtgg gtatttctta 3180 tttgaaagtg tttttcttta atcaaaagtc cttagaatga gggaaacaaa atatttattt 3240 gttttggaat cccacttatc aaatcattca aaactttcag ctggagtggg gtttgctttt 3300 gttttgtttg tgtccataag agaaatggta gaagatgaat cagtatgaag acactgtcaa 3360 tgaggttatg agaaaaacag caggggcatt agtttcaggc aaggcagctc ccaggtttag 3420 agattaattt ttacccccta aggaatatcc agtcaaagac gctgagtggg agctgtcagg 3480 cagtagcagc tgtgtttgag tttctggctg aaaatggtga agaatggact taatcatgct 3540 aacaaactga aaaatctaga catagatcct ctgatataca attagagata tttttatata 3600 gaccccaagc attctgtgca taaaagttaa cattaggctg tggtgcagta accatttaat 3660 gtcgaggctc tatttcggaa atacactaca aatgttaaag tacgtggctg tcctcttaag 3720 acactagtag agcaaagact taatcatatc aacttaattc tgttacacaa tatgtgtttt 3780 tttaatatac taaccatttc ttatggaaag gtcctgtggg gagcccatca tctcgccaag 3840 ccatcacagg ctctgcatac acatgcactc agtgtggact gggaagcatt actttgtaga 3900 tgtattttca ataaagaaaa aaatagtttt acattaaaaa aaaaaaaaaa aaaaaaaaaa 3960 aaaaaagaaa aaaaaaaaaa aaa 3983 8 542 PRT Homo sapiens 8 Met Thr Lys Ser Asn Gly Glu Glu Pro Lys Met Gly Gly Arg Met Glu 1 5 10 15 Arg Phe Gln Gln Gly Val Ser Lys Arg Thr Leu Leu Ala Lys Lys Lys 20 25 30 Val Gln Asn Ile Thr Lys Glu Asp Val Lys Ser Tyr Leu Phe Arg Asn 35 40 45 Ala Phe Val Leu Leu Thr Val Thr Ala Val Ile Val Gly Thr Ile Leu 50 55 60 Gly Phe Thr Leu Arg Pro Tyr Arg Met Ser Tyr Arg Glu Val Lys Tyr 65 70 75 80 Phe Ser Phe Pro Gly Glu Leu Leu Met Arg Met Leu Gln Met Leu Val 85 90 95 Leu Pro Leu Ile Ile Ser Ser Leu Val Thr Gly Met Ala Ala Leu Asp 100 105 110 Ser Lys Ala Ser Gly Lys Met Gly Met Arg Ala Val Val Tyr Tyr Met 115 120 125 Thr Thr Thr Ile Ile Ala Val Val Ile Gly Ile Ile Ile Val Ile Ile 130 135 140 Ile His Pro Gly Lys Gly Thr Lys Glu Asn Met His Arg Glu Gly Lys 145 150 155 160 Ile Val Arg Val Thr Ala Ala Asp Ala Phe Leu Asp Leu Ile Arg Asn 165 170 175 Met Phe Pro Pro Asn Leu Val Glu Ala Cys Phe Lys Gln Phe Lys Thr 180 185 190 Asn Tyr Glu Lys Arg Ser Phe Lys Val Pro Ile Gln Ala Asn Glu Thr 195 200 205 Leu Val Gly Ala Val Ile Asn Asn Val Ser Glu Ala Met Glu Thr Leu 210 215 220 Thr Arg Ile Thr Glu Glu Leu Val Pro Val Pro Gly Ser Val Asn Gly 225 230 235 240 Val Asn Ala Leu Gly Leu Val Val Phe Ser Met Cys Phe Gly Phe Val 245 250 255 Ile Gly Asn Met Lys Glu Gln Gly Gln Ala Leu Arg Glu Phe Phe Asp 260 265 270 Ser Leu Asn Glu Ala Ile Met Arg Leu Val Ala Val Ile Met Trp Tyr 275 280 285 Ala Pro Val Gly Ile Leu Phe Leu Ile Ala Gly Lys Ile Val Glu Met 290 295 300 Glu Asp Met Gly Val Ile Gly Gly Gln Leu Ala Met Tyr Thr Val Thr 305 310 315 320 Val Ile Val Gly Leu Leu Ile His Ala Val Ile Val Leu Pro Leu Leu 325 330 335 Tyr Phe Leu Val Thr Arg Lys Asn Pro Trp Val Phe Ile Gly Gly Leu 340 345 350 Leu Gln Ala Leu Ile Thr Ala Leu Gly Thr Ser Ser Ser Ser Ala Thr 355 360 365 Leu Pro Ile Thr Phe Lys Cys Leu Glu Glu Asn Asn Gly Val Asp Lys 370 375 380 Arg Val Thr Arg Phe Val Leu Pro Val Gly Ala Thr Ile Asn Met Asp 385 390 395 400 Gly Thr Ala Leu Tyr Glu Ala Leu Ala Ala Ile Phe Ile Ala Gln Val 405 410 415 Asn Asn Phe Glu Leu Asn Phe Gly Gln Ile Ile Thr Ile Ser Ile Thr 420 425 430 Ala Thr Ala Ala Ser Ile Gly Ala Ala Gly Ile Pro Gln Ala Gly Leu 435 440 445 Val Thr Met Val Ile Val Leu Thr Ser Val Gly Leu Pro Thr Asp Asp 450 455 460 Ile Thr Leu Ile Ile Ala Val Asp Trp Phe Leu Asp Arg Leu Arg Thr 465 470 475 480 Thr Thr Asn Val Leu Gly Asp Ser Leu Gly Ala Gly Ile Val Glu His 485 490 495 Leu Ser Arg His Glu Leu Lys Asn Arg Asp Val Glu Met Gly Asn Ser 500 505 510 Val Ile Glu Glu Asn Glu Met Lys Lys Pro Tyr Gln Leu Ile Ala Gln 515 520 525 Asp Asn Glu Thr Glu Lys Pro Ile Asp Ser Glu Thr Lys Met 530 535 540 9 1719 DNA Homo sapiens 9 gatagaccat gagcagccat ggcaacagcc tgttccttcg ggagagcggc cagcggctgg 60 gccgggtggg ctggctgcag cggctgcagg aaagcctgca gcagagagca ctgcgcacgc 120 gcctgcgcct gcagaccatg accctcgagc acgtgctgcg cttcctgcgc cgaaacgcct 180 tcattctgct gacggtcagc gccgtggtca ttggggtcag cctggccttt gccctgcgcc 240 catatcagct cacctaccgc cagatcaagt acttctcttt tcctggagag cttctgatga 300 ggatgctgca gatgctggtg ttacctctca ttgtctccag cctggtcaca ggtatggcat 360 ccctggacaa caaggccacg gggcggatgg ggatgcgggc agctgtgtac tacatggtga 420 ccaccatcat cgcggtcttc atcggcatcc tcatggtcac catcatccat cccgggaagg 480 gctccaagga ggggctgcac cgggagggcc ggatcgagac catccccaca gctgatgcct 540 tcatggacct gatcagaaat atgtttccac caaaccttgt ggaggcctgc ttcaaacagt 600 tcaagacgca gtacagcacg agggtggtaa ccaggaccat ggtgaggaca gagaacgggt 660 ctgagccggg tgcctccatg cctcctccat tctcagtgga gaacggaacc agcttcctgg 720 aaaatgtcac tcgggccttg ggtaccctgc aggagatgct gagctttgag gagactgtac 780 ccgtgcctgg ctccgccaat ggcatcaacg ccctgggcct cgtggtcttc tctgtggcct 840 ttgggctggt cattggtggc atgaaacaca agggcagagt cctcagggac ttcttcgaca 900 gcctcaatga ggctattatg aggctggtgg gcatcattat ctggtatgca cctgtgggca 960 tcctgttcct gattgctggg aagattctgg agatggaaga catggccgtc ctggggggtc 1020 agctgggcat gtacaccctg accgtcatcg tgggcctgtt cctccatgcc ggcattgtcc 1080 ttcccctcat ctacttcctc gtcactcacc ggaacccctt ccccttcatt gggggcatgc 1140 tacaagccct catcaccgct atgggcacgt cttccagctc ggcaacgctg cccatcacct 1200 tccgctgcct ggaggagggc ctgggtgtgg accgccgcat caccaggttc gtcctgcccg 1260 tgggcgccac ggtcaacatg gatggcactg ccctctacga ggccctggct gccatcttca 1320 ttgctcaagt taacaactac gagctcaacc tgggtcagat cacaaccatc agcatcacgg 1380 ccacagcagc cagtgttggg gctgctggca tcccccaggc gggtctggtc accatggtca 1440 ttgtgcttac gtcggtcggc ttgcccacgg aagacatcac gctcatcatc gccgtggact 1500 ggttccttga ccggcttcgc acaatgacca acgtactggg ggactcaatt ggagcggccg 1560 tcatcgagca cttgtctcag cgggagctgg agcttcagga agctgagctt accctcccca 1620 gcctggggaa accctacaag tccctcatgg cacaggagaa gggggcatcc cggggacggg 1680 gaggcaacga gagtgctatg tgaggggcct ccagctctg 1719 10 564 PRT Homo sapiens 10 Met Ser Ser His Gly Asn Ser Leu Phe Leu Arg Glu Ser Gly Gln Arg 1 5 10 15 Leu Gly Arg Val Gly Trp Leu Gln Arg Leu Gln Glu Ser Leu Gln Gln 20 25 30 Arg Ala Leu Arg Thr Arg Leu Arg Leu Gln Thr Met Thr Leu Glu His 35 40 45 Val Leu Arg Phe Leu Arg Arg Asn Ala Phe Ile Leu Leu Thr Val Ser 50 55 60 Ala Val Val Ile Gly Val Ser Leu Ala Phe Ala Leu Arg Pro Tyr Gln 65 70 75 80 Leu Thr Tyr Arg Gln Ile Lys Tyr Phe Ser Phe Pro Gly Glu Leu Leu 85 90 95 Met Arg Met Leu Gln Met Leu Val Leu Pro Leu Ile Val Ser Ser Leu 100 105 110 Val Thr Gly Met Ala Ser Leu Asp Asn Lys Ala Thr Gly Arg Met Gly 115 120 125 Met Arg Ala Ala Val Tyr Tyr Met Val Thr Thr Ile Ile Ala Val Phe 130 135 140 Ile Gly Ile Leu Met Val Thr Ile Ile His Pro Gly Lys Gly Ser Lys 145 150 155 160 Glu Gly Leu His Arg Glu Gly Arg Ile Glu Thr Ile Pro Thr Ala Asp 165 170 175 Ala Phe Met Asp Leu Ile Arg Asn Met Phe Pro Pro Asn Leu Val Glu 180 185 190 Ala Cys Phe Lys Gln Phe Lys Thr Gln Tyr Ser Thr Arg Val Val Thr 195 200 205 Arg Thr Met Val Arg Thr Glu Asn Gly Ser Glu Pro Gly Ala Ser Met 210 215 220 Pro Pro Pro Phe Ser Val Glu Asn Gly Thr Ser Phe Leu Glu Asn Val 225 230 235 240 Thr Arg Ala Leu Gly Thr Leu Gln Glu Met Leu Ser Phe Glu Glu Thr 245 250 255 Val Pro Val Pro Gly Ser Ala Asn Gly Ile Asn Ala Leu Gly Leu Val 260 265 270 Val Phe Ser Val Ala Phe Gly Leu Val Ile Gly Gly Met Lys His Lys 275 280 285 Gly Arg Val Leu Arg Asp Phe Phe Asp Ser Leu Asn Glu Ala Ile Met 290 295 300 Arg Leu Val Gly Ile Ile Ile Trp Tyr Ala Pro Val Gly Ile Leu Phe 305 310 315 320 Leu Ile Ala Gly Lys Ile Leu Glu Met Glu Asp Met Ala Val Leu Gly 325 330 335 Gly Gln Leu Gly Met Tyr Thr Leu Thr Val Ile Val Gly Leu Phe Leu 340 345 350 His Ala Gly Ile Val Leu Pro Leu Ile Tyr Phe Leu Val Thr His Arg 355 360 365 Asn Pro Phe Pro Phe Ile Gly Gly Met Leu Gln Ala Leu Ile Thr Ala 370 375 380 Met Gly Thr Ser Ser Ser Ser Ala Thr Leu Pro Ile Thr Phe Arg Cys 385 390 395 400 Leu Glu Glu Gly Leu Gly Val Asp Arg Arg Ile Thr Arg Phe Val Leu 405 410 415 Pro Val Gly Ala Thr Val Asn Met Asp Gly Thr Ala Leu Tyr Glu Ala 420 425 430 Leu Ala Ala Ile Phe Ile Ala Gln Val Asn Asn Tyr Glu Leu Asn Leu 435 440 445 Gly Gln Ile Thr Thr Ile Ser Ile Thr Ala Thr Ala Ala Ser Val Gly 450 455 460 Ala Ala Gly Ile Pro Gln Ala Gly Leu Val Thr Met Val Ile Val Leu 465 470 475 480 Thr Ser Val Gly Leu Pro Thr Glu Asp Ile Thr Leu Ile Ile Ala Val 485 490 495 Asp Trp Phe Leu Asp Arg Leu Arg Thr Met Thr Asn Val Leu Gly Asp 500 505 510 Ser Ile Gly Ala Ala Val Ile Glu His Leu Ser Gln Arg Glu Leu Glu 515 520 525 Leu Gln Glu Ala Glu Leu Thr Leu Pro Ser Leu Gly Lys Pro Tyr Lys 530 535 540 Ser Leu Met Ala Gln Glu Lys Gly Ala Ser Arg Gly Arg Gly Gly Asn 545 550 555 560 Glu Ser Ala Met 11 2654 DNA Homo sapiens 11 ctggtgttta gcaactccga ccacctgcct gctgaggggc tagagccctc agcccagacc 60 ctgtgccccc ggccgggctc tcatgcgtgg aatggtgctg tgccccttgc cagcaggcca 120 ggctcaccat ggtgccgcat gccatcttgg cacgggggag ggacgtgtgc aggcggaatg 180 gactcctcat cctgtctgtg ctgtctgtca tcgtgggctg cctcctcggc ttcttcttga 240 ggacccggcg cctctcacca caggaaatta gttacttcca gttccctgga gagctcctga 300 tgaggatgct gaagatgatg atcctgccac tggtggtctc cagcttgatg tccggacttg 360 cctccctgga tgccaagacc tctagccgcc tgggcgtcct caccgtggcg tactacctgt 420 ggaccacctt catggctgtc atcgtgggca tcttcatggt ctccatcatc cacccaggca 480 gcgcggccca gaaggagacc acggagcaga gtgggaagcc catcatgagc tcagccgatg 540 ccctgttgga cctcatccgg aacatgttcc cagccaacct agtagaagcc acattcaaac 600 agtaccgcac caagaccacc ccagttgtca agtcccccaa ggtggcacca gaggaggccc 660 ctcctcggcg gatcctcatc tacggggtcc aggaggagaa tggctcccat gtgcagaact 720 tcgccctgga cctgaccccg ccgcccgagg tcgtttacaa gtcagagccg ggcaccagcg 780 atggcatgaa tgtgctgggc atcgtcttct tctctgccac catgggcatc atgctgggcc 840 gcatgggtga cagcggggcc cccctggtca gcttctgcca gtgcctcaat gagtcggtca 900 tgaagatcgt ggcggtggct gtgtggtatt tccccttcgg cattgtgttc ctcattgcgg 960 gtaagatcct ggagatggac gaccccaggg ccgtcggcaa gaagctgggc ttctactcag 1020 tcaccgtggt gtgcgggctg gtgctccacg ggctctttat cctgcccctg ctctacttct 1080 tcatcaccaa gaagaatccc atcgtcttca tccgcggcat cctgcaggct ctgctcatcg 1140 cgctggccac ctcctccagc tcagccacac tgcccatcac cttcaagtgc ctgctggaga 1200 acaaccacat cgaccggcgc atcgctcgct tcgtgctgcc cgtgggtgcc accatcaaca 1260 tggacggcac tgcgctctac gaggctgtgg ccgccatctt catcgcccag gtcaacaact 1320 acgagctgga ctttggccag atcatcacca tcagtatcac agccactgca gccagcattg 1380 gggcagctgg catcccccag gccggcctcg tcaccatggt catcgtgctc acctccgtgg 1440 gactgcccac cgatgacatc accctcatca ttgccgttga ctgggctctg gaccgtttcc 1500 gcaccatgat taacgtgctg ggtgatgcgc tggcagcggg gatcatggcc catatatgtc 1560 ggaaggattt tgcccgggac acaggcaccg agaaactgct gccctgcgag accaagccag 1620 tgagcctcca ggagatcgtg gcagcccagc agaatggctg tgtgaagagt gtagccgagg 1680 cctccgagct caccctgggc cccacctgcc cccaccacgt ccccgttcaa gtggagcggg 1740 atgaggagct gcccgctgcg agtctgaacc actgcaccat ccagatcagc gagctggaga 1800 ccaatgtctg agcctgcgga gctgcagggg caggcgaggc ctccaggggc agggtcctga 1860 ggcaggaact cgactctcca accctcctga gcagccggca ggggccagga tcacacattc 1920 ttctcaccct tgagaggctg gaattaaccc cgcttgacgg aaaatgtatc tcagagaagg 1980 gaaaggctgc atgggggagc cccatccagg gagtgatggg cccggcattg cctgaggccc 2040 cgctgtgaca gtttccccgg tgtgagcccg gtgagggcgg caggcagggg ttatccggcc 2100 ccactttctg gatgacagac ttgaggctct gagagctgaa aacacttgtc caaggtctca 2160 cgttaaggtc aagacactaa ctcaaatctt tcaagccccg cctctcctct tggaggacag 2220 ggcagcctgc agctgtgtcc aggcccaggc cccaccccat aacaggtggc ctcagccaca 2280 cagttctccc caaggggagc agcccagggc caagccccgc tgccttcccc aggccacagt 2340 gcgtccagtc tcctgtcctg ccacgtgtct tttgcaaagc tccttggatg tggagacaga 2400 tgtctttact agagctgaaa ggcccccttg acacatccag gccaacctcc catggaatag 2460 gtaggcaagc caggactccg ggaaggaggt gcagccagga tgctctggtg gagctgccga 2520 tggggccctg gtgtcagaac tccccaaagg cctgtgcgtc caagtggagt caggttttct 2580 attcctttct gtgtttgcaa attcagtgtt aactaaataa aggtattttg tttttcaaaa 2640 aaaaaaaaaa aaaa 2654 12 560 PRT Homo sapiens 12 Met Val Pro His Ala Ile Leu Ala Arg Gly Arg Asp Val Cys Arg Arg 1 5 10 15 Asn Gly Leu Leu Ile Leu Ser Val Leu Ser Val Ile Val Gly Cys Leu 20 25 30 Leu Gly Phe Phe Leu Arg Thr Arg Arg Leu Ser Pro Gln Glu Ile Ser 35 40 45 Tyr Phe Gln Phe Pro Gly Glu Leu Leu Met Arg Met Leu Lys Met Met 50 55 60 Ile Leu Pro Leu Val Val Ser Ser Leu Met Ser Gly Leu Ala Ser Leu 65 70 75 80 Asp Ala Lys Thr Ser Ser Arg Leu Gly Val Leu Thr Val Ala Tyr Tyr 85 90 95 Leu Trp Thr Thr Phe Met Ala Val Ile Val Gly Ile Phe Met Val Ser 100 105 110 Ile Ile His Pro Gly Ser Ala Ala Gln Lys Glu Thr Thr Glu Gln Ser 115 120 125 Gly Lys Pro Ile Met Ser Ser Ala Asp Ala Leu Leu Asp Leu Ile Arg 130 135 140 Asn Met Phe Pro Ala Asn Leu Val Glu Ala Thr Phe Lys Gln Tyr Arg 145 150 155 160 Thr Lys Thr Thr Pro Val Val Lys Ser Pro Lys Val Ala Pro Glu Glu 165 170 175 Ala Pro Pro Arg Arg Ile Leu Ile Tyr Gly Val Gln Glu Glu Asn Gly 180 185 190 Ser His Val Gln Asn Phe Ala Leu Asp Leu Thr Pro Pro Pro Glu Val 195 200 205 Val Tyr Lys Ser Glu Pro Gly Thr Ser Asp Gly Met Asn Val Leu Gly 210 215 220 Ile Val Phe Phe Ser Ala Thr Met Gly Ile Met Leu Gly Arg Met Gly 225 230 235 240 Asp Ser Gly Ala Pro Leu Val Ser Phe Cys Gln Cys Leu Asn Glu Ser 245 250 255 Val Met Lys Ile Val Ala Val Ala Val Trp Tyr Phe Pro Phe Gly Ile 260 265 270 Val Phe Leu Ile Ala Gly Lys Ile Leu Glu Met Asp Asp Pro Arg Ala 275 280 285 Val Gly Lys Lys Leu Gly Phe Tyr Ser Val Thr Val Val Cys Gly Leu 290 295 300 Val Leu His Gly Leu Phe Ile Leu Pro Leu Leu Tyr Phe Phe Ile Thr 305 310 315 320 Lys Lys Asn Pro Ile Val Phe Ile Arg Gly Ile Leu Gln Ala Leu Leu 325 330 335 Ile Ala Leu Ala Thr Ser Ser Ser Ser Ala Thr Leu Pro Ile Thr Phe 340 345 350 Lys Cys Leu Leu Glu Asn Asn His Ile Asp Arg Arg Ile Ala Arg Phe 355 360 365 Val Leu Pro Val Gly Ala Thr Ile Asn Met Asp Gly Thr Ala Leu Tyr 370 375 380 Glu Ala Val Ala Ala Ile Phe Ile Ala Gln Val Asn Asn Tyr Glu Leu 385 390 395 400 Asp Phe Gly Gln Ile Ile Thr Ile Ser Ile Thr Ala Thr Ala Ala Ser 405 410 415 Ile Gly Ala Ala Gly Ile Pro Gln Ala Gly Leu Val Thr Met Val Ile 420 425 430 Val Leu Thr Ser Val Gly Leu Pro Thr Asp Asp Ile Thr Leu Ile Ile 435 440 445 Ala Val Asp Trp Ala Leu Asp Arg Phe Arg Thr Met Ile Asn Val Leu 450 455 460 Gly Asp Ala Leu Ala Ala Gly Ile Met Ala His Ile Cys Arg Lys Asp 465 470 475 480 Phe Ala Arg Asp Thr Gly Thr Glu Lys Leu Leu Pro Cys Glu Thr Lys 485 490 495 Pro Val Ser Leu Gln Glu Ile Val Ala Ala Gln Gln Asn Gly Cys Val 500 505 510 Lys Ser Val Ala Glu Ala Ser Glu Leu Thr Leu Gly Pro Thr Cys Pro 515 520 525 His His Val Pro Val Gln Val Glu Arg Asp Glu Glu Leu Pro Ala Ala 530 535 540 Ser Leu Asn His Cys Thr Ile Gln Ile Ser Glu Leu Glu Thr Asn Val 545 550 555 560 13 6619 DNA Homo sapiens 13 gaattccctt acaaacgcct ccagcttgta gaggcggtcg tggaggaccc agaggaggag 60 acgaagggga aggaggcggt ggtggaggag gcaaaggcct tggacgacca ttgttggcga 120 ggggcaccac tccgggagag gcggcgctgg gcgtcttggg ggtgcgcgcc gggagcctgc 180 agcgggacca gcgtgggaac gcggctggca ggctgtggac ctcgtcctca ccaccatggt 240 cgggctcctt ttgttttttt tcccagcgat ctttttggag gtgtcccttc tccccagaag 300 ccccggcagg aaagtgttgc tggcaggagc gtcgtctcag cgctcggtgg ccagaatgga 360 cggagatgtc atcattggag ccctcttctc agtccatcac cagcctccgg ccgagaaagt 420 gcccgagagg aagtgtgggg agatcaggga gcagtatggc atccagaggg tggaggccat 480 gttccacacg ttggataaga tcaacgcgga cccggtcctc ctgcccaaca tcaccctggg 540 cagtgagatc cgggactcct gctggcactc ttccgtggct ctggaacaga gcattgagtt 600 cattagggac tctctgattt ccattcgaga tgagaaggat gggatcaacc ggtgtctgcc 660 tgacggccag tccctccccc caggcaggac taagaagccc attgcgggag tgatcggtcc 720 cggctccagc tctgtagcca ttcaagtgca gaacctgctc cagctcttcg acatccccca 780 gatcgcttat tcagccacaa gcatcgacct gagtgacaaa actttgtaca aatacttcct 840 gagggttgtc ccttctgaca ctttgcaggc aagggccatg cttgacatag tcaaacgtta 900 caattggacc tatgtctctg cagtccacac ggaagggaat tatggggaga gcggaatgga 960 cgctttcaaa gagctggctg cccaggaagg cctctgtatc gcccattctg acaaaatcta 1020 cagcaacgct ggggagaaga gctttgaccg actcttgcgc aaactccgag agaggcttcc 1080 caaggctaga gtggtggtct gcttctgtga aggcatgaca gtgcgaggac tcctgagcgc 1140 catgcggcgc cttggcgtcg tgggcgagtt ctcactcatt ggaagtgatg gatgggcaga 1200 cagagatgaa gtcattgaag gttatgaggt ggaagccaac gggggaatca cgataaagct 1260 gcagtctcca gaggtcaggt catttgatga ttatttcctg aaactgaggc tggacactaa 1320 cacgaggaat ccctggttcc ctgagttctg gcaacatcgg ttccagtgcc gccttccagg 1380 acaccttctg gaaaatccca actttaaacg aatctgcaca ggcaatgaaa gcttagaaga 1440 aaactatgtc caggacagta agatggggtt tgtcatcaat gccatctatg ccatggcaca 1500 tgggctgcag aacatgcacc atgccctctg ccctggccac gtgggcctct gcgatgccat 1560 gaagcccatc gacggcagca agctgctgga cttcctcatc aagtcctcat tcattggagt 1620 atctggagag gaggtgtggt ttgatgagaa aggagacgct cctggaaggt atgatatcat 1680 gaatctgcag tacactgaag ctaatcgcta tgactatgtg cacgttggaa cctggcatga 1740 aggagtgctg aacattgatg attacaaaat ccagatgaac aagagtggag tggtgcggtc 1800 tgtgtgcagt gagccttgct taaagggcca gattaaggtt atacggaaag gagaagtgag 1860 ctgctgctgg atttgcacgg cctgcaaaga gaatgaatat gtgcaagatg agttcacctg 1920 caaagcttgt gacttgggat ggtggcccaa tgcagatcta acaggctgtg agcccattcc 1980 tgtgcgctat cttgagtgga gcaacatcga atccattata gccatcgcct tttcatgcct 2040 gggaatcctt gttaccttgt ttgtcaccct aatctttgta ctgtaccggg acacaccagt 2100 ggtcaaatcc tccagtcggg agctctgcta catcatccta gctggcatct tccttggtta 2160 tgtgtgccca ttcactctca ttgccaaacc tactaccacc tcctgctacc tccagcgcct 2220 cttggttggc ctctcctctg cgatgtgcta ctctgcttta gtgactaaaa ccaatcgtat 2280 tgcacgcatc ctggctggca gcaagaagaa gatctgcacc cggaagccca ggttcatgag 2340 tgcctgggct caggtgatca ttgcctcaat tctgattagt gtgcaactaa ccctggtggt 2400 aaccctgatc atcatggaac cccctatgcc cattctgtcc tacccaagta tcaaggaagt 2460 ctaccttatc tgcaatacca gcaacctggg tgtggtggcc cctttgggct acaatggact 2520 cctcatcatg agctgtacct actatgcctt caagacccgc aacgtgcccg ccaacttcaa 2580 cgaggccaaa tatatcgcgt tcaccatgta caccacctgt atcatctggc tagcttttgt 2640 gcccatttac tttgggagca actacaagat catcacaact tgctttgcag tgagtctcag 2700 tgtaacagtg gctctggggt gcatgttcac tcccaagatg tacatcatta ttgccaagcc 2760 tgagaggaat gtccgcagtg ccttcaccac ctctgatgtt gtccgcatgc atgttggcga 2820 tggcaagctg ccctgccgct ccaacacttt cctcaacatc ttccgaagaa agaaggcagg 2880 ggcagggaat gccaattcta atggcaagtc tgtgtcatgg tctgaaccag gtggaggaca 2940 ggtgcccaag ggacagcata tgtggcaccg cctctctgtg cacgtgaaga ccaatgagac 3000 ggcctgcaac caaacagccg tcatcaaacc cctcactaaa agttaccaag gctctggcaa 3060 gagcctgacc ttttcagata ccagcaccaa gaccctttac aacgtagagg aggaggagga 3120 tgcccagccg attcgcttta gcccgcctgg tagcccttcc atggtggtgc acaggcgcgt 3180 gccaagcgcg gcgaccactc cgcctctgcc gccccacctg accgcagagg agacccccct 3240 cttcctggcc gaaccagccc tccccaaggg cttgccccct cctctccagc agcagcagca 3300 accccctcca cagcagaaat cgctgatgga ccagctccag ggagtggtca gcaacttcag 3360 taccgcgatc ccggattttc acgcggtgct ggcaggcccc gggggtcccg ggaacgggct 3420 gcggtccctg tacccgcccc cgccaccgcc gcagcacctg cagatgctgc cgctgcagct 3480 gagcaccttt ggggaggagc tggtctcccc gcccgcggac gacgacgacg acagcgagag 3540 gtttaagctc ctccaggagt acgtgtatga gcacgagcgg gaagggaaca cggaagaaga 3600 cgaactggaa gaggaggagg aggacctgca ggcggccagc aaactgaccc cggatgattc 3660 gcctgcgctg acgcctccgt cgcctttccg cgactcggtg gcctcgggca gctcggtgcc 3720 cagctcccca gtgtccgagt cggtgctctg cacccctccc aacgtatcct acgcctctgt 3780 cattctgcgg gactacaagc aaagctcttc caccctgtaa gggggaaggg tccacataga 3840 aaagcaagac aagccagaga tctcccacac ctccagagat gtgcaaacag ctgggaggaa 3900 aagcctggga gtggggggcc tcgtcgggag gacaggagac cgctgctgct gctgccgcta 3960 ctgctgctgc tgccttaagt aggaagagag ggaaggacac caagcaaaaa atgttcaggc 4020 caggattcgg attcttgaat tactcgaagc cttctctggg aagaaaggga attctgacaa 4080 agcacaattc catatggtat gtaactttta tcacaaatca aatagtgaca tcacaaacat 4140 aatgtcctct tttgcacaat tgtgcataga tatatatatg cccacacaca ctgggccatg 4200 cttggcaagg aacagaccac gtggcatcca gtcggatcat gagttcacct gatgcattcg 4260 gagtgagctg gtggagccag acagagcagg tgcggggaag ggaagggcca ggccagaccc 4320 atcccaaacg gatgatggga tgatgggaca gcagttcctt gctcagaagc ccttctcccc 4380 gctgggctga cagactcctc atcttcagga gactcaggaa tggagcggta caggggtctc 4440 tcttcatcca ccgcaaccca tccagtgcca gctttgagat tgcacttgaa gaaaggtgca 4500 tggaccccct gctgctctgc agattccctt tatttaggaa aacaggaata agagcaaaat 4560 tatcaccaaa aagtgcttca tcaggcgtgc tacaggagga aggagctaga aatagaacaa 4620 tccatcagca tgagactttg aaaaaaaaaa cacatgatca gcttctcatg ttccatattc 4680 acttattggc gatttgggga aaaggccgga acaagagatt gttacgagag tggcagaaac 4740 ccttttgtag attgacttgt gtttgtgcca agcgggcttt ccattgacct tcagttaaag 4800 aacaaaccat gtgacaaaat tgttaccttc cacttactgt agcaaataat acctacaagt 4860 tgaacttcta agatgcgtat atgtacaatt tggtgccatt atttctccta cgtattagag 4920 aaacaaatcc atctttgaat ctaatggtgt actcatagca actattactg gtttaaatga 4980 caaataattc tatcctattg tcactgaagt ccttgtaact agcgagtgaa tgtgttcctg 5040 tgtccttgta tatgtgcgat cgtaaaattt gtgcaatgta atgtcaaatt gacctgtcaa 5100 tgtcaaccta gtagtcaatc taactgcaat tagaaattgt cttttgaata tactatatat 5160 attttttatg ttccaataat gttttataca tcattgtcat caatatctac agaagctctt 5220 tgacggtttg aatactatgg ctcaaggttt tcatatgcag ctcggatgga catttttctt 5280 ctaagatgga acttattttt cagatatttt ctgatgtgga gatatgttat taatgaagtg 5340 gtttgaaaat ttgttatatt aaaagtgcac aaaaactgag agtgaaaata aaaggtacat 5400 tttataagct tgcacacatt attaacacat aagattgaac aaagcattta gattattcca 5460 ggttatatca tttttttaaa gattttccac agctacttga gtgtctaaca tacagtaaca 5520 tctaactcag ctaataattt gtaaaatctt tatcaatcac attgtggcct cttttaattt 5580 ttatgttcat ggacttttat tcctgtgtct tggctgtcat aactttttat ttctgctatt 5640 tgctgttgtg taatatccat ggacatgtaa tccacttact ccatctttac aatccctttt 5700 taccaccaat aaaaggattt tttcttgctg ttttgatttc ttctattatt tgtggaatga 5760 attatacccc ccttaaatat ctttgtttat gccttatgtt cagtcatatt ttaatatgct 5820 tccttcatat tgaagctgct gatttctcag ccaaaaatca tcttagaatc tttaaatatc 5880 cattgcatca tttgttcaga atttaacatc cattccaatg ttggaggctt gtattactta 5940 tatttcatca tattctattg ccaagtttag tcagttccac accaagaatg aactgcattt 6000 cctttaaaaa ttattttaaa acacctttat tgaaaagatc tcatgactga gatgtggact 6060 ttggttccat gttttcattg taagaaagca gagagcggaa aatcaatggc tccagtgatt 6120 aatagatggg tttttagtaa ttgacaaatt catgagggaa agcatatgat ctctttatta 6180 gtgaatcatg cttatttttt actcttaacg ccactaatat acatccctaa tatcacaggg 6240 cttgtgcatt cagattttta aaaaattagg atagataagg aaacaactta tattcaagtg 6300 taagatgata tcaggttggt ctaagacttt tggtgaacac gttcattcaa ctgtgatcac 6360 tttattactc tgaatgccta ctattatcct gattatgggg tctcctgaat aaatagagta 6420 ttagtcttta tgtcatcatt gttcaaaatt ggagatgtac acatacatac cctataccaa 6480 gagggccgaa actcttcacc ttgatgtatg ttctgataca agttgttcag cttcttgtaa 6540 atgtgttttc cttcggcttg ttactgcctt ttgtcaaata atcttgacaa tgctgtataa 6600 taaatatttt ctatttatt 6619 14 1194 PRT Homo sapiens 14 Met Val Gly Leu Leu Leu Phe Phe Phe Pro Ala Ile Phe Leu Glu Val 1 5 10 15 Ser Leu Leu Pro Arg Ser Pro Gly Arg Lys Val Leu Leu Ala Gly Ala 20 25 30 Ser Ser Gln Arg Ser Val Ala Arg Met Asp Gly Asp Val Ile Ile Gly 35 40 45 Ala Leu Phe Ser Val His His Gln Pro Pro Ala Glu Lys Val Pro Glu 50 55 60 Arg Lys Cys Gly Glu Ile Arg Glu Gln Tyr Gly Ile Gln Arg Val Glu 65 70 75 80 Ala Met Phe His Thr Leu Asp Lys Ile Asn Ala Asp Pro Val Leu Leu 85 90 95 Pro Asn Ile Thr Leu Gly Ser Glu Ile Arg Asp Ser Cys Trp His Ser 100 105 110 Ser Val Ala Leu Glu Gln Ser Ile Glu Phe Ile Arg Asp Ser Leu Ile 115 120 125 Ser Ile Arg Asp Glu Lys Asp Gly Ile Asn Arg Cys Leu Pro Asp Gly 130 135 140 Gln Ser Leu Pro Pro Gly Arg Thr Lys Lys Pro Ile Ala Gly Val Ile 145 150 155 160 Gly Pro Gly Ser Ser Ser Val Ala Ile Gln Val Gln Asn Leu Leu Gln 165 170 175 Leu Phe Asp Ile Pro Gln Ile Ala Tyr Ser Ala Thr Ser Ile Asp Leu 180 185 190 Ser Asp Lys Thr Leu Tyr Lys Tyr Phe Leu Arg Val Val Pro Ser Asp 195 200 205 Thr Leu Gln Ala Arg Ala Met Leu Asp Ile Val Lys Arg Tyr Asn Trp 210 215 220 Thr Tyr Val Ser Ala Val His Thr Glu Gly Asn Tyr Gly Glu Ser Gly 225 230 235 240 Met Asp Ala Phe Lys Glu Leu Ala Ala Gln Glu Gly Leu Cys Ile Ala 245 250 255 His Ser Asp Lys Ile Tyr Ser Asn Ala Gly Glu Lys Ser Phe Asp Arg 260 265 270 Leu Leu Arg Lys Leu Arg Glu Arg Leu Pro Lys Ala Arg Val Val Val 275 280 285 Cys Phe Cys Glu Gly Met Thr Val Arg Gly Leu Leu Ser Ala Met Arg 290 295 300 Arg Leu Gly Val Val Gly Glu Phe Ser Leu Ile Gly Ser Asp Gly Trp 305 310 315 320 Ala Asp Arg Asp Glu Val Ile Glu Gly Tyr Glu Val Glu Ala Asn Gly 325 330 335 Gly Ile Thr Ile Lys Leu Gln Ser Pro Glu Val Arg Ser Phe Asp Asp 340 345 350 Tyr Phe Leu Lys Leu Arg Leu Asp Thr Asn Thr Arg Asn Pro Trp Phe 355 360 365 Pro Glu Phe Trp Gln His Arg Phe Gln Cys Arg Leu Pro Gly His Leu 370 375 380 Leu Glu Asn Pro Asn Phe Lys Arg Ile Cys Thr Gly Asn Glu Ser Leu 385 390 395 400 Glu Glu Asn Tyr Val Gln Asp Ser Lys Met Gly Phe Val Ile Asn Ala 405 410 415 Ile Tyr Ala Met Ala His Gly Leu Gln Asn Met His His Ala Leu Cys 420 425 430 Pro Gly His Val Gly Leu Cys Asp Ala Met Lys Pro Ile Asp Gly Ser 435 440 445 Lys Leu Leu Asp Phe Leu Ile Lys Ser Ser Phe Ile Gly Val Ser Gly 450 455 460 Glu Glu Val Trp Phe Asp Glu Lys Gly Asp Ala Pro Gly Arg Tyr Asp 465 470 475 480 Ile Met Asn Leu Gln Tyr Thr Glu Ala Asn Arg Tyr Asp Tyr Val His 485 490 495 Val Gly Thr Trp His Glu Gly Val Leu Asn Ile Asp Asp Tyr Lys Ile 500 505 510 Gln Met Asn Lys Ser Gly Val Val Arg Ser Val Cys Ser Glu Pro Cys 515 520 525 Leu Lys Gly Gln Ile Lys Val Ile Arg Lys Gly Glu Val Ser Cys Cys 530 535 540 Trp Ile Cys Thr Ala Cys Lys Glu Asn Glu Tyr Val Gln Asp Glu Phe 545 550 555 560 Thr Cys Lys Ala Cys Asp Leu Gly Trp Trp Pro Asn Ala Asp Leu Thr 565 570 575 Gly Cys Glu Pro Ile Pro Val Arg Tyr Leu Glu Trp Ser Asn Ile Glu 580 585 590 Ser Ile Ile Ala Ile Ala Phe Ser Cys Leu Gly Ile Leu Val Thr Leu 595 600 605 Phe Val Thr Leu Ile Phe Val Leu Tyr Arg Asp Thr Pro Val Val Lys 610 615 620 Ser Ser Ser Arg Glu Leu Cys Tyr Ile Ile Leu Ala Gly Ile Phe Leu 625 630 635 640 Gly Tyr Val Cys Pro Phe Thr Leu Ile Ala Lys Pro Thr Thr Thr Ser 645 650 655 Cys Tyr Leu Gln Arg Leu Leu Val Gly Leu Ser Ser Ala Met Cys Tyr 660 665 670 Ser Ala Leu Val Thr Lys Thr Asn Arg Ile Ala Arg Ile Leu Ala Gly 675 680 685 Ser Lys Lys Lys Ile Cys Thr Arg Lys Pro Arg Phe Met Ser Ala Trp 690 695 700 Ala Gln Val Ile Ile Ala Ser Ile Leu Ile Ser Val Gln Leu Thr Leu 705 710 715 720 Val Val Thr Leu Ile Ile Met Glu Pro Pro Met Pro Ile Leu Ser Tyr 725 730 735 Pro Ser Ile Lys Glu Val Tyr Leu Ile Cys Asn Thr Ser Asn Leu Gly 740 745 750 Val Val Ala Pro Leu Gly Tyr Asn Gly Leu Leu Ile Met Ser Cys Thr 755 760 765 Tyr Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala Asn Phe Asn Glu Ala 770 775 780 Lys Tyr Ile Ala Phe Thr Met Tyr Thr Thr Cys Ile Ile Trp Leu Ala 785 790 795 800 Phe Val Pro Ile Tyr Phe Gly Ser Asn Tyr Lys Ile Ile Thr Thr Cys 805 810 815 Phe Ala Val Ser Leu Ser Val Thr Val Ala Leu Gly Cys Met Phe Thr 820 825 830 Pro Lys Met Tyr Ile Ile Ile Ala Lys Pro Glu Arg Asn Val Arg Ser 835 840 845 Ala Phe Thr Thr Ser Asp Val Val Arg Met His Val Gly Asp Gly Lys 850 855 860 Leu Pro Cys Arg Ser Asn Thr Phe Leu Asn Ile Phe Arg Arg Lys Lys 865 870 875 880 Ala Gly Ala Gly Asn Ala Asn Ser Asn Gly Lys Ser Val Ser Trp Ser 885 890 895 Glu Pro Gly Gly Gly Gln Val Pro Lys Gly Gln His Met Trp His Arg 900 905 910 Leu Ser Val His Val Lys Thr Asn Glu Thr Ala Cys Asn Gln Thr Ala 915 920 925 Val Ile Lys Pro Leu Thr Lys Ser Tyr Gln Gly Ser Gly Lys Ser Leu 930 935 940 Thr Phe Ser Asp Thr Ser Thr Lys Thr Leu Tyr Asn Val Glu Glu Glu 945 950 955 960 Glu Asp Ala Gln Pro Ile Arg Phe Ser Pro Pro Gly Ser Pro Ser Met 965 970 975 Val Val His Arg Arg Val Pro Ser Ala Ala Thr Thr Pro Pro Leu Pro 980 985 990 Pro His Leu Thr Ala Glu Glu Thr Pro Leu Phe Leu Ala Glu Pro Ala 995 1000 1005 Leu Pro Lys Gly Leu Pro Pro Pro Leu Gln Gln Gln Gln Gln Pro Pro 1010 1015 1020 Pro Gln Gln Lys Ser Leu Met Asp Gln Leu Gln Gly Val Val Ser Asn 1025 1030 1035 1040 Phe Ser Thr Ala Ile Pro Asp Phe His Ala Val Leu Ala Gly Pro Gly 1045 1050 1055 Gly Pro Gly Asn Gly Leu Arg Ser Leu Tyr Pro Pro Pro Pro Pro Pro 1060 1065 1070 Gln His Leu Gln Met Leu Pro Leu Gln Leu Ser Thr Phe Gly Glu Glu 1075 1080 1085 Leu Val Ser Pro Pro Ala Asp Asp Asp Asp Asp Ser Glu Arg Phe Lys 1090 1095 1100 Leu Leu Gln Glu Tyr Val Tyr Glu His Glu Arg Glu Gly Asn Thr Glu 1105 1110 1115 1120 Glu Asp Glu Leu Glu Glu Glu Glu Glu Asp Leu Gln Ala Ala Ser Lys 1125 1130 1135 Leu Thr Pro Asp Asp Ser Pro Ala Leu Thr Pro Pro Ser Pro Phe Arg 1140 1145 1150 Asp Ser Val Ala Ser Gly Ser Ser Val Pro Ser Ser Pro Val Ser Glu 1155 1160 1165 Ser Val Leu Cys Thr Pro Pro Asn Val Ser Tyr Ala Ser Val Ile Leu 1170 1175 1180 Arg Asp Tyr Lys Gln Ser Ser Ser Thr Leu 1185 1190 15 4518 DNA Homo sapiens 15 acaaaatggt cctttagaaa atacatctga attgctggct aatttcttga tttgcgactc 60 aacgtaggac atcgcttgtt cgtagctatc agaaccctcc tgaattttcc ccaccatgct 120 atctttattg gcttgaactc ctttcctaaa atggtccttc tgttgatcct gtcagtctta 180 cttttgaaag aagatgtccg tgggagtgca cagtccagtg agaggagggt ggtggctcac 240 atgccgggtg acatcattat tggagctctc ttttctgttc atcaccagcc tactgtggac 300 aaagttcatg agaggaagtg tggggcggtc cgtgaacagt atggcattca gagagtggag 360 gccatgctgc ataccctgga aaggatcaat tcagacccca cactcttgcc caacatcaca 420 ctgggctgtg agataaggga ctcctgctgg cattcggctg tggccctaga gcagagcatt 480 gagttcataa gagattccct catttcttca gaagaggaag aaggcttggt acgctgtgtg 540 gatggctcct cctcttcctt ccgctccaag aagcccatag taggggtcat tgggcctggc 600 tccagttctg tagccattca ggtccagaat ttgctccagc ttttcaacat acctcagatt 660 gcttactcag caaccagcat ggatctgagt gacaagactc tgttcaaata tttcatgagg 720 gttgtgcctt cagatgctca gcaggcaagg gccatggtgg acatagtgaa gaggtacaac 780 tggacctatg tatcagccgt gcacacagaa ggcaactatg gagaaagtgg gatggaagcc 840 ttcaaagata tgtcagcgaa ggaagggatt tgcatcgccc actcttacaa aatctacagt 900 aatgcagggg agcagagctt tgataagctg ctgaagaagc tcacaagtca cttgcccaag 960 gcccgggtgg tggcctgctt ctgtgagggc atgacggtga gaggtctgct gatggccatg 1020 aggcgcctgg gtctagcggg agaatttctg cttctgggca gtgatggctg ggctgacagg 1080 tatgatgtga cagatggata tcagcgagaa gctgttggtg gcatcacaat caagctccaa 1140 tctcccgatg tcaagtggtt tgatgattat tatctgaagc tccggccaga aacaaaccac 1200 cgaaaccctt ggtttcaaga attttggcag catcgttttc agtgccgact ggaagggttt 1260 ccacaggaga acagcaaata caacaagact tgcaatagtt ctctgactct gaaaacacat 1320 catgttcagg attccaaaat gggatttgtg atcaacgcca tctattcgat ggcctatggg 1380 ctccacaaca tgcagatgtc cctctgccca ggctatgcag gactctgtga tgccatgaag 1440 ccaattgatg gacggaaact tttggagtcc ctgatgaaaa ccaattttac tggggtttct 1500 ggagatacga tcctattcga tgagaatgga gactctccag gaaggtatga aataatgaat 1560 ttcaaggaaa tgggaaaaga ttactttgat tatatcaacg ttggaagttg ggacaatgga 1620 gaattaaaaa tggatgatga tgaagtatgg tccaagaaaa gcaacatcat cagatctgtg 1680 tgcagtgaac catgtgagaa aggccagatc aaggtgatcc gaaagggaga agtcagctgt 1740 tgttggacct gtacaccttg taaggagaat gagtatgtct ttgatgagta cacatgcaag 1800 gcatgccaac tggggtcttg gcccactgat gatctcacag gttgtgactt gatcccagta 1860 cagtatcttc gatggggtga ccctgaaccc attgcagctg tggtgtttgc ctgccttggc 1920 ctcctggcca ccctgtttgt tactgtagtc ttcatcattt accgtgatac accagtagtc 1980 aagtcctcaa gcagggaact ctgctacatt atccttgctg gcatctgcct gggctactta 2040 tgtaccttct gcctcattgc gaagcccaaa cagatttact gctaccttca gagaattggc 2100 attggtctct ccccagccat gagctactca gcccttgtaa caaagaccaa ccgtattgca 2160 aggatcctgg ctggcagcaa gaagaagatc tgtaccaaaa agcccagatt catgagtgcc 2220 tgtgcccagc tagtgattgc tttcattctc atatgcatcc agttgggcat catcgttgcc 2280 ctctttataa tggagcctcc tgacataatg catgactacc caagcattcg agaagtctac 2340 ctgatctgta acaccaccaa cctaggagtt gtcactccac ttggatacaa tggattgttg 2400 attttgagct gcaccttcta tgcgttcaag accagaaatg ttccagctaa cttcaacgag 2460 gccaagtata tcgccttcac aatgtacacg acctgcatta tatggctagc ttttgtgcca 2520 atctactttg gcagcaacta caaaatcatc accatgtgtt tctcggtcag cctcagtgcc 2580 acagtggccc taggctgcat gtttgtgccg aaggtgtaca tcatcctggc caaaccagag 2640 agaaacgtgc gcagcgcctt caccacatct accgtggtgc gcatgcatgt aggggatggc 2700 aagtcatcct ccgcagccag cagatccagc agcctagtca acctgtggaa gagaaggggc 2760 tcctctgggg aaaccttaag ttccaatgga aaatccgtca cgtgggccca gaatgagaag 2820 agcagccggg ggcagcacct gtggcagcgc ctgtccatcc acatcaacaa gaaagaaaac 2880 cccaaccaaa cggccgtcat caagcccttc cccaagagca cggagagccg tggcctgggc 2940 gctggcgctg gcgcaggcgg gagcgctggg ggcgtggggg ccacgggcgg tgcgggctgc 3000 gcaggcgccg gcccaggcgg gcccgagtcc ccagacgccg gccccaaggc gctgtatgat 3060 gtggccgagg ctgaggagca cttcccggcg cccgcgcggc cgcgctcacc gtcgcccatc 3120 agcacgctga gccaccgcgc gggctcggcc agccgcacgg acgacgatgt gccgtcgctg 3180 cactcggagc ctgtggcgcg cagcagctcc tcgcagggct ccctcatgga gcagatcagc 3240 agtgtggtca cccgcttcac ggccaacatc agcgagctca actccatgat gctgtccacc 3300 gcggccccca gccccggcgt cggcgccccg ctctgctcgt cctacctgat ccccaaagag 3360 atccagttgc ccacgaccat gacgaccttt gccgaaatcc agcctctgcc ggccatcgaa 3420 gtcacgggcg gcgcgcagcc cgcggcaggg gcgcaggcgg ctggggacgc ggcccgggag 3480 agccccgcgg ccggtcccga ggctgcggcc gccaagccag acctggagga gctggtggct 3540 ctcaccccgc cgtccccctt cagagactcg gtggactcgg ggagcacaac ccccaactcg 3600 ccagtgtccg agtcggccct ctgtatcccg tcgtctccca aatatgacac tcttatcata 3660 agagattaca ctcagagctc ctcgtcgttg tgaatgtccc tggaaagcac gccggcctgc 3720 gcgtgcggag cggagccccc cgtgttcaca cacacacaat ggcaagcata gtcgcctggt 3780 tacggcccag ggggaatatg ccaagggacc ccttaatgga aacacagatc agtagtgcta 3840 tctcatgaca accacaagaa accgacgaca aatcttttgc gagattttct tctagtggct 3900 tagaaacatg gcttttaaga aacacggtga tatctttgag ggtgacaagg cgtctcttca 3960 aacagttcca taccaactgc tttgctctag ggaagcagtg cgtgtgaaac agcgtaacgg 4020 agggtgaaga gcatagttaa taagcaactg taaaaagttt tatttgttta ctttaattct 4080 tttcccctgt aaaaagtttt atttgtttac tttaattctt ttcccagaaa agagtctttg 4140 attcaccaaa catgaatgta cattttctaa caaactcaaa atctgggacc aaaacatcaa 4200 cttttttctt tcttttttct ttctttttgt tttttctttc ctgtaaagac cttgaaaaga 4260 ccttgaaaag cagtaacttg ggtccagtat ttacggaggc gttgtgaatg tgtcccatgc 4320 ataacacact actggatagt gagtcgtgcg ctaatgtact acgtagggct tctaccagag 4380 attttcctct ccaattgggt tgtgaaatac tcttccaaaa gcctgcatcg gggattccac 4440 ctacttattt cagattcacc tccattaacc aagaaaacca gtggaagatt tcttgactat 4500 ttcaccatgt tgccaatc 4518 16 1180 PRT Homo sapiens 16 Met Val Leu Leu Leu Ile Leu Ser Val Leu Leu Leu Lys Glu Asp Val 1 5 10 15 Arg Gly Ser Ala Gln Ser Ser Glu Arg Arg Val Val Ala His Met Pro 20 25 30 Gly Asp Ile Ile Ile Gly Ala Leu Phe Ser Val His His Gln Pro Thr 35 40 45 Val Asp Lys Val His Glu Arg Lys Cys Gly Ala Val Arg Glu Gln Tyr 50 55 60 Gly Ile Gln Arg Val Glu Ala Met Leu His Thr Leu Glu Arg Ile Asn 65 70 75 80 Ser Asp Pro Thr Leu Leu Pro Asn Ile Thr Leu Gly Cys Glu Ile Arg 85 90 95 Asp Ser Cys Trp His Ser Ala Val Ala Leu Glu Gln Ser Ile Glu Phe 100 105 110 Ile Arg Asp Ser Leu Ile Ser Ser Glu Glu Glu Glu Gly Leu Val Arg 115 120 125 Cys Val Asp Gly Ser Ser Ser Ser Phe Arg Ser Lys Lys Pro Ile Val 130 135 140 Gly Val Ile Gly Pro Gly Ser Ser Ser Val Ala Ile Gln Val Gln Asn 145 150 155 160 Leu Leu Gln Leu Phe Asn Ile Pro Gln Ile Ala Tyr Ser Ala Thr Ser 165 170 175 Met Asp Leu Ser Asp Lys Thr Leu Phe Lys Tyr Phe Met Arg Val Val 180 185 190 Pro Ser Asp Ala Gln Gln Ala Arg Ala Met Val Asp Ile Val Lys Arg 195 200 205 Tyr Asn Trp Thr Tyr Val Ser Ala Val His Thr Glu Gly Asn Tyr Gly 210 215 220 Glu Ser Gly Met Glu Ala Phe Lys Asp Met Ser Ala Lys Glu Gly Ile 225 230 235 240 Cys Ile Ala His Ser Tyr Lys Ile Tyr Ser Asn Ala Gly Glu Gln Ser 245 250 255 Phe Asp Lys Leu Leu Lys Lys Leu Thr Ser His Leu Pro Lys Ala Arg 260 265 270 Val Val Ala Cys Phe Cys Glu Gly Met Thr Val Arg Gly Leu Leu Met 275 280 285 Ala Met Arg Arg Leu Gly Leu Ala Gly Glu Phe Leu Leu Leu Gly Ser 290 295 300 Asp Gly Trp Ala Asp Arg Tyr Asp Val Thr Asp Gly Tyr Gln Arg Glu 305 310 315 320 Ala Val Gly Gly Ile Thr Ile Lys Leu Gln Ser Pro Asp Val Lys Trp 325 330 335 Phe Asp Asp Tyr Tyr Leu Lys Leu Arg Pro Glu Thr Asn His Arg Asn 340 345 350 Pro Trp Phe Gln Glu Phe Trp Gln His Arg Phe Gln Cys Arg Leu Glu 355 360 365 Gly Phe Pro Gln Glu Asn Ser Lys Tyr Asn Lys Thr Cys Asn Ser Ser 370 375 380 Leu Thr Leu Lys Thr His His Val Gln Asp Ser Lys Met Gly Phe Val 385 390 395 400 Ile Asn Ala Ile Tyr Ser Met Ala Tyr Gly Leu His Asn Met Gln Met 405 410 415 Ser Leu Cys Pro Gly Tyr Ala Gly Leu Cys Asp Ala Met Lys Pro Ile 420 425 430 Asp Gly Arg Lys Leu Leu Glu Ser Leu Met Lys Thr Asn Phe Thr Gly 435 440 445 Val Ser Gly Asp Thr Ile Leu Phe Asp Glu Asn Gly Asp Ser Pro Gly 450 455 460 Arg Tyr Glu Ile Met Asn Phe Lys Glu Met Gly Lys Asp Tyr Phe Asp 465 470 475 480 Tyr Ile Asn Val Gly Ser Trp Asp Asn Gly Glu Leu Lys Met Asp Asp 485 490 495 Asp Glu Val Trp Ser Lys Lys Ser Asn Ile Ile Arg Ser Val Cys Ser 500 505 510 Glu Pro Cys Glu Lys Gly Gln Ile Lys Val Ile Arg Lys Gly Glu Val 515 520 525 Ser Cys Cys Trp Thr Cys Thr Pro Cys Lys Glu Asn Glu Tyr Val Phe 530 535 540 Asp Glu Tyr Thr Cys Lys Ala Cys Gln Leu Gly Ser Trp Pro Thr Asp 545 550 555 560 Asp Leu Thr Gly Cys Asp Leu Ile Pro Val Gln Tyr Leu Arg Trp Gly 565 570 575 Asp Pro Glu Pro Ile Ala Ala Val Val Phe Ala Cys Leu Gly Leu Leu 580 585 590 Ala Thr Leu Phe Val Thr Val Val Phe Ile Ile Tyr Arg Asp Thr Pro 595 600 605 Val Val Lys Ser Ser Ser Arg Glu Leu Cys Tyr Ile Ile Leu Ala Gly 610 615 620 Ile Cys Leu Gly Tyr Leu Cys Thr Phe Cys Leu Ile Ala Lys Pro Lys 625 630 635 640 Gln Ile Tyr Cys Tyr Leu Gln Arg Ile Gly Ile Gly Leu Ser Pro Ala 645 650 655 Met Ser Tyr Ser Ala Leu Val Thr Lys Thr Asn Arg Ile Ala Arg Ile 660 665 670 Leu Ala Gly Ser Lys Lys Lys Ile Cys Thr Lys Lys Pro Arg Phe Met 675 680 685 Ser Ala Cys Ala Gln Leu Val Ile Ala Phe Ile Leu Ile Cys Ile Gln 690 695 700 Leu Gly Ile Ile Val Ala Leu Phe Ile Met Glu Pro Pro Asp Ile Met 705 710 715 720 His Asp Tyr Pro Ser Ile Arg Glu Val Tyr Leu Ile Cys Asn Thr Thr 725 730 735 Asn Leu Gly Val Val Thr Pro Leu Gly Tyr Asn Gly Leu Leu Ile Leu 740 745 750 Ser Cys Thr Phe Tyr Ala Phe Lys Thr Arg Asn Val Pro Ala Asn Phe 755 760 765 Asn Glu Ala Lys Tyr Ile Ala Phe Thr Met Tyr Thr Thr Cys Ile Ile 770 775 780 Trp Leu Ala Phe Val Pro Ile Tyr Phe Gly Ser Asn Tyr Lys Ile Ile 785 790 795 800 Thr Met Cys Phe Ser Val Ser Leu Ser Ala Thr Val Ala Leu Gly Cys 805 810 815 Met Phe Val Pro Lys Val Tyr Ile Ile Leu Ala Lys Pro Glu Arg Asn 820 825 830 Val Arg Ser Ala Phe Thr Thr Ser Thr Val Val Arg Met His Val Gly 835 840 845 Asp Gly Lys Ser Ser Ser Ala Ala Ser Arg Ser Ser Ser Leu Val Asn 850 855 860 Leu Trp Lys Arg Arg Gly Ser Ser Gly Glu Thr Leu Ser Ser Asn Gly 865 870 875 880 Lys Ser Val Thr Trp Ala Gln Asn Glu Lys Ser Ser Arg Gly Gln His 885 890 895 Leu Trp Gln Arg Leu Ser Ile His Ile Asn Lys Lys Glu Asn Pro Asn 900 905 910 Gln Thr Ala Val Ile Lys Pro Phe Pro Lys Ser Thr Glu Ser Arg Gly 915 920 925 Leu Gly Ala Gly Ala Gly Ala Gly Gly Ser Ala Gly Gly Val Gly Ala 930 935 940 Thr Gly Gly Ala Gly Cys Ala Gly Ala Gly Pro Gly Gly Pro Glu Ser 945 950 955 960 Pro Asp Ala Gly Pro Lys Ala Leu Tyr Asp Val Ala Glu Ala Glu Glu 965 970 975 His Phe Pro Ala Pro Ala Arg Pro Arg Ser Pro Ser Pro Ile Ser Thr 980 985 990 Leu Ser His Arg Ala Gly Ser Ala Ser Arg Thr Asp Asp Asp Val Pro 995 1000 1005 Ser Leu His Ser Glu Pro Val Ala Arg Ser Ser Ser Ser Gln Gly Ser 1010 1015 1020 Leu Met Glu Gln Ile Ser Ser Val Val Thr Arg Phe Thr Ala Asn Ile 1025 1030 1035 1040 Ser Glu Leu Asn Ser Met Met Leu Ser Thr Ala Ala Pro Ser Pro Gly 1045 1050 1055 Val Gly Ala Pro Leu Cys Ser Ser Tyr Leu Ile Pro Lys Glu Ile Gln 1060 1065 1070 Leu Pro Thr Thr Met Thr Thr Phe Ala Glu Ile Gln Pro Leu Pro Ala 1075 1080 1085 Ile Glu Val Thr Gly Gly Ala Gln Pro Ala Ala Gly Ala Gln Ala Ala 1090 1095 1100 Gly Asp Ala Ala Arg Glu Ser Pro Ala Ala Gly Pro Glu Ala Ala Ala 1105 1110 1115 1120 Ala Lys Pro Asp Leu Glu Glu Leu Val Ala Leu Thr Pro Pro Ser Pro 1125 1130 1135 Phe Arg Asp Ser Val Asp Ser Gly Ser Thr Thr Pro Asn Ser Pro Val 1140 1145 1150 Ser Glu Ser Ala Leu Cys Ile Pro Ser Ser Pro Lys Tyr Asp Thr Leu 1155 1160 1165 Ile Ile Arg Asp Tyr Thr Gln Ser Ser Ser Ser Leu 1170 1175 1180 17 2621 DNA Homo sapiens 17 ccatgggatc gctgcttgcg ctcctggcac tgctgccgct gtggggtgct gtggctgagg 60 gcccagccaa gaaggtgctg accctggagg gagacttggt gctgggtggg ctgttcccag 120 tgcaccagaa gggcggccca gcagaggact gtggtcctgt caatgagcac cgtggcatcc 180 agcgcctgga ggccatgctt tttgcactgg accgcatcaa ccgtgacccg cacctgctgc 240 ctggcgtgcg cctgggtgca cacatcctcg acagttgctc caaggacaca catgcgctgg 300 agcaggcact ggactttgtg cgtgcctcac tcagccgtgg tgctgatgga tcacgccaca 360 tctgccccga cggctcttat gcgacccatg gtgatgctcc cactgccatc actggtgtta 420 ttggcggttc ctacagtgat gtctccatcc aggtggccaa cctcttgagg ctatttcaga 480 tcccacagat tagctacgcc tctaccagtg ccaagctgag tgacaagtcc cgctatgact 540 actttgcccg cacagtgcct cctgacttct tccaagccaa ggccatggct gagattctcc 600 gcttcttcaa ctggacctat gtgtccactg aggcctctga gggcgactat ggcgagacag 660 gcattgaggc ctttgagcta gaggctcgtg cccgcaacat ctgtgtggcc acctcggaga 720 aagtgggccg tgccatgagc cgcgcggcct ttgagggtgt ggtgcgagcc ctgctgcaga 780 agcccagtgc ccgcgtggct gtcctgttca cccgttctga ggatgcccgg gagctgcttg 840 ctgccagcca gcgcctcaat gccagcttca cctgggtggc cagtgatggt tggggggccc 900 tggagagtgt ggtggcaggc agtgaggggg ctgctgaggg tgctatcacc atcgagctgg 960 cctcctaccc catcagtgac tttgcctcct acttccagag cctggaccct tggaacaaca 1020 gccggaaccc ctggttccgt gaattctggg agcagaggtt ccgctgcagc ttccggcagc 1080 gagactgcgc agcccactct ctccgggctg tgccctttga acaggagtcc aagatcatgt 1140 ttgtggtcaa tgcagtgtac gccatggccc atgcgctcca caacatgcac cgtgccctct 1200 gccccaacac cacccggctc tgtgacgcga tgcggccagt taacgggcgc cgcctctaca 1260 aggactttgt gctcaacgtc aagtttgatg ccccctttcg cccagctgac acccacaatg 1320 aggtccgctt tgaccgcttt ggtgatggta ttggccgcta caacatcttc acctatctgc 1380 gtgcaggcag tgggcgctat cgctaccaga aggtgggcta ctgggcagaa ggcttgactc 1440 tggacaccag cctcatccca tgggcctcac cgtcagccgg ccccctggcc gcctctcgct 1500 gcagtgagcc ctgcctccag aatgaggtga agagtgtgca gccgggcgaa gtctgctgct 1560 ggctctgcat tccgtgccag ccctatgagt accgattgga cgaattcact tgcgctgatt 1620 gtggcctggg ctactggccc aatgccagcc tgactggctg cttcgaactg ccccaggagt 1680 acatccgctg gggcgatgcc tgggctgtgg gacctgtcac catcgcctgc ctcggtgccc 1740 tggccaccct gtttgtgctg ggtgtctttg tgcggcacaa tgccacacca gtggtcaagg 1800 cctcaggtcg ggagctctgc tacatcctgc tgggtggtgt cttcctctgc tactgcatga 1860 ccttcatctt cattgccaag ccatccacgg cagtgtgtac cttacggcgt cttggtttgg 1920 gcactgcctt ctctgtctgc tactcagccc tgctcaccaa gaccaaccgc attgcacgca 1980 tcttcggtgg ggcccgggag ggtgcccagc ggccacgctt catcagtcct gcctcacagg 2040 tggccatctg cctggcactt atctcgggcc agctgctcat cgtggtcgcc tggctggtgg 2100 tggaggcacc gggcacaggc aaggagacag cccccgaacg gcgggaggtg gtgacactgc 2160 gctgcaacca ccgcgatgca agtatgttgg gctcgctggc ctacaatgtg ctcctcatcg 2220 cgctctgcac gctttatgcc ttcaatactc gcaagtgccc cgaaaacttc aacgaggcca 2280 agttcattgg cttcaccatg tacaccacct gcatcatctg gctggcattg ttgcccatct 2340 tctatgtcac ctccagtgac taccgggtac agaccaccac catgtgcgtg tcagtcagcc 2400 tcagcggctc cgtggtgctt ggctgcctct ttgcgcccaa gctgcacatc atcctcttcc 2460 agccgcagaa gaacgtggtt agccaccggg cacccaccag ccgctttggc agtgctgctg 2520 ccagggccag ctccagcctt ggccaagggt ctggctccca gtttgtcccc actgtttgca 2580 atggccgtga ggtggtggac tcgacaacgt catcgctttg a 2621 18 872 PRT Homo sapiens 18 Met Gly Ser Leu Leu Ala Leu Leu Ala Leu Leu Pro Leu Trp Gly Ala 1 5 10 15 Val Ala Glu Gly Pro Ala Lys Lys Val Leu Thr Leu Glu Gly Asp Leu 20 25 30 Val Leu Gly Gly Leu Phe Pro Val His Gln Lys Gly Gly Pro Ala Glu 35 40 45 Asp Cys Gly Pro Val Asn Glu His Arg Gly Ile Gln Arg Leu Glu Ala 50 55 60 Met Leu Phe Ala Leu Asp Arg Ile Asn Arg Asp Pro His Leu Leu Pro 65 70 75 80 Gly Val Arg Leu Gly Ala His Ile Leu Asp Ser Cys Ser Lys Asp Thr 85 90 95 His Ala Leu Glu Gln Ala Leu Asp Phe Val Arg Ala Ser Leu Ser Arg 100 105 110 Gly Ala Asp Gly Ser Arg His Ile Cys Pro Asp Gly Ser Tyr Ala Thr 115 120 125 His Gly Asp Ala Pro Thr Ala Ile Thr Gly Val Ile Gly Gly Ser Tyr 130 135 140 Ser Asp Val Ser Ile Gln Val Ala Asn Leu Leu Arg Leu Phe Gln Ile 145 150 155 160 Pro Gln Ile Ser Tyr Ala Ser Thr Ser Ala Lys Leu Ser Asp Lys Ser 165 170 175 Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro Pro Asp Phe Phe Gln Ala 180 185 190 Lys Ala Met Ala Glu Ile Leu Arg Phe Phe Asn Trp Thr Tyr Val Ser 195 200 205 Thr Glu Ala Ser Glu Gly Asp Tyr Gly Glu Thr Gly Ile Glu Ala Phe 210 215 220 Glu Leu Glu Ala Arg Ala Arg Asn Ile Cys Val Ala Thr Ser Glu Lys 225 230 235 240 Val Gly Arg Ala Met Ser Arg Ala Ala Phe Glu Gly Val Val Arg Ala 245 250 255 Leu Leu Gln Lys Pro Ser Ala Arg Val Ala Val Leu Phe Thr Arg Ser 260 265 270 Glu Asp Ala Arg Glu Leu Leu Ala Ala Ser Gln Arg Leu Asn Ala Ser 275 280 285 Phe Thr Trp Val Ala Ser Asp Gly Trp Gly Ala Leu Glu Ser Val Val 290 295 300 Ala Gly Ser Glu Gly Ala Ala Glu Gly Ala Ile Thr Ile Glu Leu Ala 305 310 315 320 Ser Tyr Pro Ile Ser Asp Phe Ala Ser Tyr Phe Gln Ser Leu Asp Pro 325 330 335 Trp Asn Asn Ser Arg Asn Pro Trp Phe Arg Glu Phe Trp Glu Gln Arg 340 345 350 Phe Arg Cys Ser Phe Arg Gln Arg Asp Cys Ala Ala His Ser Leu Arg 355 360 365 Ala Val Pro Phe Glu Gln Glu Ser Lys Ile Met Phe Val Val Asn Ala 370 375 380 Val Tyr Ala Met Ala His Ala Leu His Asn Met His Arg Ala Leu Cys 385 390 395 400 Pro Asn Thr Thr Arg Leu Cys Asp Ala Met Arg Pro Val Asn Gly Arg 405 410 415 Arg Leu Tyr Lys Asp Phe Val Leu Asn Val Lys Phe Asp Ala Pro Phe 420 425 430 Arg Pro Ala Asp Thr His Asn Glu Val Arg Phe Asp Arg Phe Gly Asp 435 440 445 Gly Ile Gly Arg Tyr Asn Ile Phe Thr Tyr Leu Arg Ala Gly Ser Gly 450 455 460 Arg Tyr Arg Tyr Gln Lys Val Gly Tyr Trp Ala Glu Gly Leu Thr Leu 465 470 475 480 Asp Thr Ser Leu Ile Pro Trp Ala Ser Pro Ser Ala Gly Pro Leu Ala 485 490 495 Ala Ser Arg Cys Ser Glu Pro Cys Leu Gln Asn Glu Val Lys Ser Val 500 505 510 Gln Pro Gly Glu Val Cys Cys Trp Leu Cys Ile Pro Cys Gln Pro Tyr 515 520 525 Glu Tyr Arg Leu Asp Glu Phe Thr Cys Ala Asp Cys Gly Leu Gly Tyr 530 535 540 Trp Pro Asn Ala Ser Leu Thr Gly Cys Phe Glu Leu Pro Gln Glu Tyr 545 550 555 560 Ile Arg Trp Gly Asp Ala Trp Ala Val Gly Pro Val Thr Ile Ala Cys 565 570 575 Leu Gly Ala Leu Ala Thr Leu Phe Val Leu Gly Val Phe Val Arg His 580 585 590 Asn Ala Thr Pro Val Val Lys Ala Ser Gly Arg Glu Leu Cys Tyr Ile 595 600 605 Leu Leu Gly Gly Val Phe Leu Cys Tyr Cys Met Thr Phe Ile Phe Ile 610 615 620 Ala Lys Pro Ser Thr Ala Val Cys Thr Leu Arg Arg Leu Gly Leu Gly 625 630 635 640 Thr Ala Phe Ser Val Cys Tyr Ser Ala Leu Leu Thr Lys Thr Asn Arg 645 650 655 Ile Ala Arg Ile Phe Gly Gly Ala Arg Glu Gly Ala Gln Arg Pro Arg 660 665 670 Phe Ile Ser Pro Ala Ser Gln Val Ala Ile Cys Leu Ala Leu Ile Ser 675 680 685 Gly Gln Leu Leu Ile Val Val Ala Trp Leu Val Val Glu Ala Pro Gly 690 695 700 Thr Gly Lys Glu Thr Ala Pro Glu Arg Arg Glu Val Val Thr Leu Arg 705 710 715 720 Cys Asn His Arg Asp Ala Ser Met Leu Gly Ser Leu Ala Tyr Asn Val 725 730 735 Leu Leu Ile Ala Leu Cys Thr Leu Tyr Ala Phe Asn Thr Arg Lys Cys 740 745 750 Pro Glu Asn Phe Asn Glu Ala Lys Phe Ile Gly Phe Thr Met Tyr Thr 755 760 765 Thr Cys Ile Ile Trp Leu Ala Leu Leu Pro Ile Phe Tyr Val Thr Ser 770 775 780 Ser Asp Tyr Arg Val Gln Thr Thr Thr Met Cys Val Ser Val Ser Leu 785 790 795 800 Ser Gly Ser Val Val Leu Gly Cys Leu Phe Ala Pro Lys Leu His Ile 805 810 815 Ile Leu Phe Gln Pro Gln Lys Asn Val Val Ser His Arg Ala Pro Thr 820 825 830 Ser Arg Phe Gly Ser Ala Ala Ala Arg Ala Ser Ser Ser Leu Gly Gln 835 840 845 Gly Ser Gly Ser Gln Phe Val Pro Thr Val Cys Asn Gly Arg Glu Val 850 855 860 Val Asp Ser Thr Thr Ser Ser Leu 865 870 19 4260 DNA Homo sapiens 19 gtgcagttga gtcgcgagta cggctgagct gcgtaccggc ctccctggct ctcacactcc 60 ctctctgctc ccgctctcct aatctcctct ggcatgcggt cagccccctg cccagggacc 120 acaggagagt tcttgtaagg actgttagtc cctgcttacc tgaaagccaa gcgctctagc 180 agagctttaa agttggagcc gccaccctcc ctaccgcccc atgccccttc accccactcc 240 gaaattcacc gacctttgca tgcactgcct aaggatttca gagtgaggca aagcagtcgg 300 caaatctacc ctggcttttc gtataaaaat cctctcgtct aggtaccctg gctcactgaa 360 gactctgcag atataccctt ataagaggga gggtggggga gggaaaagaa cgagagaggg 420 aggaaagaat gaaaaggaga ggatgccagg aggtccgtgc ttctgccaag agtcccaatt 480 agatgcgacg gcttcagcct ggtcaaggtg aaggaaagtt gcttccgcgc ctaggaagtg 540 ggtttgcctg ataagagaag gaggagggga ctcggctggg aagagctccc ctcccctccg 600 cggaagacca ctgggtcccc tctttcccca acctcctccc tctcttctac tccacccctc 660 cgttttccca ctccccactg actcggatgc ctggatgttc tgccaccggg cagtggtcca 720 gcgtgcagcc gggagggggc aggggcaggg ggcactgtga caggaagctg cgcgcacaag 780 ttggccattt cgagggcaaa ataagttctc ccttggattt ggaaaggaca aagccagtaa 840 gctacctctt ttgtgtcgga tgaggaggac caaccatgag ccagagcccg ggtgcaggct 900 caccgccgcc gctgccaccg cggtcagctc cagttcctgc caggagttgt cggtgcgagg 960 aattttgtga caggctctgt tagtctgttc ctcccttatt tgaaggacag gccaaagatc 1020 cagtttggaa atgagagagg actagcatga cacattggct ccaccattga tatctcccag 1080 aggtacagaa acaggattca tgaagatgtt gacaagactg caagttctta ccttagcttt 1140 gttttcaaag ggatttttac tctctttagg ggaccataac tttctaagga gagagattaa 1200 aatagaaggt gaccttgttt tagggggcct gtttcctatt aacgaaaaag gcactggaac 1260 tgaagaatgt gggcgaatca atgaagaccg agggattcaa cgcctggaag ccatgttgtt 1320 tgctattgat gaaatcaaca aagatgatta cttgctacca ggagtgaagt tgggtgttca 1380 cattttggat acatgttcaa gggataccta tgcattggag caatcactgg agtttgtcag 1440 ggcatctttg acaaaagtgg atgaagctga gtatatgtgt cctgatggat cctatgccat 1500 tcaagaaaac atcccacttc tcattgcagg ggtcattggt ggctcttata gcagtgtttc 1560 catacaggtg gcaaacctgc tgcggctctt ccagatccct cagatcagct acgcatccac 1620 cagcgccaaa ctcagtgata agtcgcgcta tgattacttt gccaggaccg tgccccccga 1680 cttctaccag gccaaagcca tggctgagat cttgcgcttc ttcaactgga cctacgtgtc 1740 cacagtagcc tccgagggtg attacgggga gacagggatc gaggccttcg agcaggaagc 1800 ccgcctgcgc aacatctgca tcgctacggc ggagaaggtg ggccgctcca acatccgcaa 1860 gtcctacgac agcgtgatcc gagaactgtt gcagaagccc aacgcgcgcg tcgtggtcct 1920 cttcatgcgc agcgacgact cgcgggagct cattgcagcc gccagccgcg ccaatgcctc 1980 cttcacctgg gtggccagcg acggctgggg cgcgcaggag agcatcatca agggcagcga 2040 gcatgtggcc tacggcgcca tcaccctgga gctggcctcc cagcctgtcc gccagttcga 2100 ccgctacttc cagagcctca acccctacaa caaccaccgc aacccctggt tccgggactt 2160 ctgggagcaa aagtttcagt gcagcctcca gaacaaacgc aaccacaggc gcgtctgcga 2220 caagcacctg gccatcgaca gcagcaacta cgagcaagag tccaagatca tgtttgtggt 2280 gaacgcggtg tatgccatgg cccacgcttt gcacaaaatg cagcgcaccc tctgtcccaa 2340 cactaccaag ctttgtgatg ctatgaagat cctggatggg aagaagttgt acaaggatta 2400 cttgctgaaa atcaacttca cggctccatt caacccaaat aaagatgcag atagcatagt 2460 caagtttgac acttttggag atggaatggg gcgatacaac gtgttcaatt tccaaaatgt 2520 aggtggaaag tattcctact tgaaagttgg tcactgggca gaaaccttat cgctagatgt 2580 caactctatc cactggtccc ggaactcagt ccccacttcc cagtgcagcg acccctgtgc 2640 ccccaatgaa atgaagaata tgcaaccagg ggatgtctgc tgctggattt gcatcccctg 2700 tgaaccctac gaatacctgg ctgatgagtt tacctgtatg gattgtgggt ctggacagtg 2760 gcccactgca gacctaactg gatgctatga ccttcctgag gactacatca ggtgggaaga 2820 cgcctgggcc attggcccag tcaccattgc ctgtctgggt tttatgtgta catgcatggt 2880 tgtaactgtt tttatcaagc acaacaacac acccttggtc aaagcatcgg gccgagaact 2940 ctgctacatc ttattgtttg gggttggcct gtcatactgc atgacattct tcttcattgc 3000 caagccatca ccagtcatct gtgcattgcg ccgactcggg ctggggagtt ccttcgctat 3060 ctgttactca gccctgctga ccaagacaaa ctgcattgcc cgcatcttcg atggggtcaa 3120 gaatggcgct cagaggccaa aattcatcag ccccagttct caggttttca tctgcctggg 3180 tctgatcctg gtgcaaattg tgatggtgtc tgtgtggctc atcctggagg ccccaggcac 3240 caggaggtat acccttgcag agaagcggga aacagtcatc ctaaaatgca atgtcaaaga 3300 ttccagcatg ttgatctctc ttacctacga tgtgatcctg gtgatcttat gcactgtgta 3360 cgccttcaaa acgcggaagt gcccagaaaa tttcaacgaa gctaagttca taggttttac 3420 catgtacacc acgtgcatca tctggttggc cttcctccct atattttatg tgacatcaag 3480 tgactacaga gtgcagacga caaccatgtg catctctgtc agcctgagtg gctttgtggt 3540 cttgggctgt ttgtttgcac ccaaggttca catcatcctg tttcaacccc agaagaatgt 3600 tgtcacacac agactgcacc tcaacaggtt cagtgtcagt ggaactggga ccacatactc 3660 tcagtcctct gcaagcacgt atgtgccaac ggtgtgcaat gggcgggaag tcctcgactc 3720 caccacctca tctctgtgat tgtgaattgc agttcagttc ttgtgttttt agactgttag 3780 acaaaagtgc tcacgtgcag ctccagaata tggaaacaga gcaaaagaac aaccctagta 3840 ccttttttta gaaacagtac gataaattat ttttgaggac tgtatatagt gatgtgctag 3900 aactttctag gctgagtcta gtgcccctat tattaacaat tcccccagaa catggaaata 3960 accattgttt acagagctga gcattggtga cagggtctga catggtcagt ctactaaaaa 4020 acaaaaaaaa aaaacaaaaa aaaaaaaaca aaagaaaaaa ataaaaatac ggtggcaata 4080 ttatgtaacc ttttttccta tgaagttttt tgtaggtcct tgttgtaact aatttaggat 4140 gagtttctat gttgtatatt aaagttacat tatgtgtaac agattgattt tctcagcaca 4200 aaataaaaag catctgtatt aatgtaaaga tactgagaat aaaaccttca aggttttcca 4260 20 879 PRT Homo sapiens 20 Met Lys Met Leu Thr Arg Leu Gln Val Leu Thr Leu Ala Leu Phe Ser 1 5 10 15 Lys Gly Phe Leu Leu Ser Leu Gly Asp His Asn Phe Leu Arg Arg Glu 20 25 30 Ile Lys Ile Glu Gly Asp Leu Val Leu Gly Gly Leu Phe Pro Ile Asn 35 40 45 Glu Lys Gly Thr Gly Thr Glu Glu Cys Gly Arg Ile Asn Glu Asp Arg 50 55 60 Gly Ile Gln Arg Leu Glu Ala Met Leu Phe Ala Ile Asp Glu Ile Asn 65 70 75 80 Lys Asp Asp Tyr Leu Leu Pro Gly Val Lys Leu Gly Val His Ile Leu 85 90 95 Asp Thr Cys Ser Arg Asp Thr Tyr Ala Leu Glu Gln Ser Leu Glu Phe 100 105 110 Val Arg Ala Ser Leu Thr Lys Val Asp Glu Ala Glu Tyr Met Cys Pro 115 120 125 Asp Gly Ser Tyr Ala Ile Gln Glu Asn Ile Pro Leu Leu Ile Ala Gly 130 135 140 Val Ile Gly Gly Ser Tyr Ser Ser Val Ser Ile Gln Val Ala Asn Leu 145 150 155 160 Leu Arg Leu Phe Gln Ile Pro Gln Ile Ser Tyr Ala Ser Thr Ser Ala 165 170 175 Lys Leu Ser Asp Lys Ser Arg Tyr Asp Tyr Phe Ala Arg Thr Val Pro 180 185 190 Pro Asp Phe Tyr Gln Ala Lys Ala Met Ala Glu Ile Leu Arg Phe Phe 195 200 205 Asn Trp Thr Tyr Val Ser Thr Val Ala Ser Glu Gly Asp Tyr Gly Glu 210 215 220 Thr Gly Ile Glu Ala Phe Glu Gln Glu Ala Arg Leu Arg Asn Ile Cys 225 230 235 240 Ile Ala Thr Ala Glu Lys Val Gly Arg Ser Asn Ile Arg Lys Ser Tyr 245 250 255 Asp Ser Val Ile Arg Glu Leu Leu Gln Lys Pro Asn Ala Arg Val Val 260 265 270 Val Leu Phe Met Arg Ser Asp Asp Ser Arg Glu Leu Ile Ala Ala Ala 275 280 285 Ser Arg Ala Asn Ala Ser Phe Thr Trp Val Ala Ser Asp Gly Trp Gly 290 295 300 Ala Gln Glu Ser Ile Ile Lys Gly Ser Glu His Val Ala Tyr Gly Ala 305 310 315 320 Ile Thr Leu Glu Leu Ala Ser Gln Pro Val Arg Gln Phe Asp Arg Tyr 325 330 335 Phe Gln Ser Leu Asn Pro Tyr Asn Asn His Arg Asn Pro Trp Phe Arg 340 345 350 Asp Phe Trp Glu Gln Lys Phe Gln Cys Ser Leu Gln Asn Lys Arg Asn 355 360 365 His Arg Arg Val Cys Asp Lys His Leu Ala Ile Asp Ser Ser Asn Tyr 370 375 380 Glu Gln Glu Ser Lys Ile Met Phe Val Val Asn Ala Val Tyr Ala Met 385 390 395 400 Ala His Ala Leu His Lys Met Gln Arg Thr Leu Cys Pro Asn Thr Thr 405 410 415 Lys Leu Cys Asp Ala Met Lys Ile Leu Asp Gly Lys Lys Leu Tyr Lys 420 425 430 Asp Tyr Leu Leu Lys Ile Asn Phe Thr Ala Pro Phe Asn Pro Asn Lys 435 440 445 Asp Ala Asp Ser Ile Val Lys Phe Asp Thr Phe Gly Asp Gly Met Gly 450 455 460 Arg Tyr Asn Val Phe Asn Phe Gln Asn Val Gly Gly Lys Tyr Ser Tyr 465 470 475 480 Leu Lys Val Gly His Trp Ala Glu Thr Leu Ser Leu Asp Val Asn Ser 485 490 495 Ile His Trp Ser Arg Asn Ser Val Pro Thr Ser Gln Cys Ser Asp Pro 500 505 510 Cys Ala Pro Asn Glu Met Lys Asn Met Gln Pro Gly Asp Val Cys Cys 515 520 525 Trp Ile Cys Ile Pro Cys Glu Pro Tyr Glu Tyr Leu Ala Asp Glu Phe 530 535 540 Thr Cys Met Asp Cys Gly Ser Gly Gln Trp Pro Thr Ala Asp Leu Thr 545 550 555 560 Gly Cys Tyr Asp Leu Pro Glu Asp Tyr Ile Arg Trp Glu Asp Ala Trp 565 570 575 Ala Ile Gly Pro Val Thr Ile Ala Cys Leu Gly Phe Met Cys Thr Cys 580 585 590 Met Val Val Thr Val Phe Ile Lys His Asn Asn Thr Pro Leu Val Lys 595 600 605 Ala Ser Gly Arg Glu Leu Cys Tyr Ile Leu Leu Phe Gly Val Gly Leu 610 615 620 Ser Tyr Cys Met Thr Phe Phe Phe Ile Ala Lys Pro Ser Pro Val Ile 625 630 635 640 Cys Ala Leu Arg Arg Leu Gly Leu Gly Ser Ser Phe Ala Ile Cys Tyr 645 650 655 Ser Ala Leu Leu Thr Lys Thr Asn Cys Ile Ala Arg Ile Phe Asp Gly 660 665 670 Val Lys Asn Gly Ala Gln Arg Pro Lys Phe Ile Ser Pro Ser Ser Gln 675 680 685 Val Phe Ile Cys Leu Gly Leu Ile Leu Val Gln Ile Val Met Val Ser 690 695 700 Val Trp Leu Ile Leu Glu Ala Pro Gly Thr Arg Arg Tyr Thr Leu Ala 705 710 715 720 Glu Lys Arg Glu Thr Val Ile Leu Lys Cys Asn Val Lys Asp Ser Ser 725 730 735 Met Leu Ile Ser Leu Thr Tyr Asp Val Ile Leu Val Ile Leu Cys Thr 740 745 750 Val Tyr Ala Phe Lys Thr Arg Lys Cys Pro Glu Asn Phe Asn Glu Ala 755 760 765 Lys Phe Ile Gly Phe Thr Met Tyr Thr Thr Cys Ile Ile Trp Leu Ala 770 775 780 Phe Leu Pro Ile Phe Tyr Val Thr Ser Ser Asp Tyr Arg Val Gln Thr 785 790 795 800 Thr Thr Met Cys Ile Ser Val Ser Leu Ser Gly Phe Val Val Leu Gly 805 810 815 Cys Leu Phe Ala Pro Lys Val His Ile Ile Leu Phe Gln Pro Gln Lys 820 825 830 Asn Val Val Thr His Arg Leu His Leu Asn Arg Phe Ser Val Ser Gly 835 840 845 Thr Gly Thr Thr Tyr Ser Gln Ser Ser Ala Ser Thr Tyr Val Pro Thr 850 855 860 Val Cys Asn Gly Arg Glu Val Leu Asp Ser Thr Thr Ser Ser Leu 865 870 875 21 3884 DNA Homo sapiens 21 ccgagtgaca aggaggtggg agagggtagc agcatgggct acgcggttgg ctgccctcag 60 tccccctgct gctgaagctg ccctgcccat gcccacccag gccgtggggc caggggcctg 120 ccagggctag gagtgggcct gccgttcatg ggtctctagg gatttccgag atgcctggga 180 agagaggctt gggctggtgg tgggcccggc tgcccctttg cctgctcctc agcctttacg 240 gcccctggat gccttcctcc ctgggaaagc ccaaaggcca ccctcacatg aattccatcc 300 gcatagatgg ggacatcaca ctgggaggcc tgttcccggt gcatggccgg ggctcagagg 360 gcaagccctg tggagaactt aagaaggaaa agggcatcca ccggctggag gccatgctgt 420 tcgccctgga tcgcatcaac aacgacccgg acctgctgcc taacatcacg ctgggcgccc 480 gcattctgga cacctgctcc agggacaccc atgccctcga gcagtcgctg acctttgtgc 540 aggcgctcat cgagaaggat ggcacagagg tccgctgtgg cagtggcggc ccacccatca 600 tcaccaagcc tgaacgtgtg gtgggtgtca tcggtgcttc agggagctcg gtctccatca 660 tggtggccaa catccttcgc ctcttcaaga taccccagat cagctacgcc tccacagcgc 720 cagacctgag tgacaacagc cgctacgact tcttctcccg cgtggtgccc tcggacacgt 780 accaggccca ggccatggtg gacatcgtcc gtgccctcaa gtggaactat gtgtccacag 840 tggcctcgga gggcagctat ggtgagagcg gtgtggaggc cttcatccag aagtcccgtg 900 aggacggggg cgtgtgcatc gcccagtcgg tgaagatacc acgggagccc aaggcaggcg 960 agttcgacaa gatcatccgc cgcctcctgg agacttcgaa cgccagggca gtcatcatct 1020 ttgccaacga ggatgacatc aggcgtgtgc tggaggcagc acgaagggcc aaccagacag 1080 gccatttctt ctggatgggc tctgacagct ggggctccaa gattgcacct gtgctgcacc 1140 tggaggaggt ggctgagggt gctgtcacga tcctccccaa gaggatgtcc gtacgaggct 1200 tcgaccgcta cttctccagc cgcacgctgg acaacaaccg gcgcaacatc tggtttgccg 1260 agttctggga ggacaacttc cactgcaagc tgagccgcca cgccctcaag aagggcagcc 1320 acgtcaagaa gtgcaccaac cgtgagcgaa ttgggcagga ttcagcttat gagcaggagg 1380 ggaaggtgca gtttgtgatc gatgccgtgt acgccatggg ccacgcgctg cacgccatgc 1440 accgtgacct gtgtcccggc cgcgtggggc tctgcccgcg catggaccct gtagatggca 1500 cccagctgct taagtacatc cgaaacgtca acttctcagg catcgcaggg aaccctgtga 1560 ccttcaatga gaatggagat gcgcctgggc gctatgacat ctaccaatac cagctgcgca 1620 acgattctgc cgagtacaag gtcattggct cctggactga ccacctgcac cttagaatag 1680 agcggatgca ctggccgggg agcgggcagc agctgccccg ctccatctgc agcctgccct 1740 gccaaccggg tgagcggaag aagacagtga agggcatgcc ttgctgctgg cactgcgagc 1800 cttgcacagg gtaccagtac caggtggacc gctacacctg taagacgtgt ccctatgaca 1860 tgcggcccac agagaaccgc acgggctgcc ggcccatccc catcatcaag cttgagtggg 1920 gctcgccctg ggccgtgctg cccctcttcc tggccgtggt gggcatcgct gccacgttgt 1980 tcgtggtgat cacctttgtg cgctacaacg acacgcccat cgtcaaggcc tcgggccgtg 2040 aactgagcta cgtgctgctg gcaggcatct tcctgtgcta tgccaccacc ttcctcatga 2100 tcgctgagcc cgaccttggc acctgctcgc tgcgccgaat cttcctggga ctagggatga 2160 gcatcagcta tgcagccctg ctcaccaaga ccaaccgcat ctaccgcatc ttcgagcagg 2220 gcaagcgctc ggtcagtgcc ccacgcttca tcagccccgc ctcacagctg gccatcacct 2280 tcagcctcat ctcgctgcag ctgctgggca tctgtgtgtg gtttgtggtg gacccctccc 2340 actcggtggt ggacttccag gaccagcgga cactcgaccc ccgcttcgcc aggggtgtgc 2400 tcaagtgtga catctcggac ctgtcgctca tctgcctgct gggctacagc atgctgctca 2460 tggtcacgtg caccgtgtat gccatcaaga cacgcggcgt gcccgagacc ttcaatgagg 2520 ccaagcccat tggcttcacc atgtacacca cttgcatcgt ctggctggcc ttcatcccca 2580 tcttctttgg cacctcgcag tcggccgaca agctgtacat ccagacgacg acgctgacgg 2640 tctcggtgag tctgagcgcc tcggtgtccc tgggaatgct ctacatgccc aaagtctaca 2700 tcatcctctt ccacccggag cagaacgtgc ccaagcgcaa gcgcagcctc aaagccgtcg 2760 ttacggcggc caccatgtcc aacaagttca cgcagaaggg caacttccgg cccaacggag 2820 aggccaagtc tgagctctgc gagaaccttg aggccccagc gctggccacc aaacagactt 2880 acgtcactta caccaaccat gcaatctagc gagtccatgg agctgagcag caggaggagg 2940 agccgtgacc ctgtggaagg tgcgtcgggc cagggccaca cccaagggcc cagctgtctt 3000 gcctgcccgt gggcacccac ggacgtggct tggtgctgag gatagcagag cccccagcca 3060 tcactgctgg cagcctgggc aaaccgggtg agcaacagga ggacgagggg ccggggcggt 3120 gccaggctac cacaagaacc tgcgtcttgg accattgccc ctcccggccc caaaccacag 3180 gggctcaggt cgtgtgggcc ccagtgctag atctctccct cccttcgtct ctgtctgtgc 3240 tgttggcgac ccctctgtct gtctccagcc ctgtctttct gttctcttat ctctttgttt 3300 caccttttcc ctctctggcg tccccggctg cttgtactct tggccttttc tgtgtctcct 3360 ttctggctct tgcctccgcc tctctctctc atcctctttg tcctcagctc ctcctgcttt 3420 cttgggtccc accagtgtca cttttctgcc gttttctttc ctgttctcct ctgcttcatt 3480 ctcgtccagc cattgctccc ctctccctgc cacccttccc cagttcacca aaccttacat 3540 gttgcaaaag agaaaaaagg aaaaaaaatc aaaacacaaa aaagccaaaa cgaaaacaaa 3600 tctcgagtgt gttgccaagt gctgcgtcct cctggtggcc tctgtgtgtg tccctgtggc 3660 ccgcagcctg cccgcctgcc ccgcccatct gccgtgtgtc ttgcccgcct gccccgcccg 3720 tctgccgtct gtcttgcccg cctgcccgcc tgcccctcct gccgaccaca cggagttcag 3780 tgcctgggtg tttggtgatg gttattgacg acaatgtgta gcgcatgatt gtttttatac 3840 caagaacatt tctaataaaa ataaacacat ggttttgcaa aaaa 3884 22 912 PRT Homo sapiens 22 Met Pro Gly Lys Arg Gly Leu Gly Trp Trp Trp Ala Arg Leu Pro Leu 1 5 10 15 Cys Leu Leu Leu Ser Leu Tyr Gly Pro Trp Met Pro Ser Ser Leu Gly 20 25 30 Lys Pro Lys Gly His Pro His Met Asn Ser Ile Arg Ile Asp Gly Asp 35 40 45 Ile Thr Leu Gly Gly Leu Phe Pro Val His Gly Arg Gly Ser Glu Gly 50 55 60 Lys Pro Cys Gly Glu Leu Lys Lys Glu Lys Gly Ile His Arg Leu Glu 65 70 75 80 Ala Met Leu Phe Ala Leu Asp Arg Ile Asn Asn Asp Pro Asp Leu Leu 85 90 95 Pro Asn Ile Thr Leu Gly Ala Arg Ile Leu Asp Thr Cys Ser Arg Asp 100 105 110 Thr His Ala Leu Glu Gln Ser Leu Thr Phe Val Gln Ala Leu Ile Glu 115 120 125 Lys Asp Gly Thr Glu Val Arg Cys Gly Ser Gly Gly Pro Pro Ile Ile 130 135 140 Thr Lys Pro Glu Arg Val Val Gly Val Ile Gly Ala Ser Gly Ser Ser 145 150 155 160 Val Ser Ile Met Val Ala Asn Ile Leu Arg Leu Phe Lys Ile Pro Gln 165 170 175 Ile Ser Tyr Ala Ser Thr Ala Pro Asp Leu Ser Asp Asn Ser Arg Tyr 180 185 190 Asp Phe Phe Ser Arg Val Val Pro Ser Asp Thr Tyr Gln Ala Gln Ala 195 200 205 Met Val Asp Ile Val Arg Ala Leu Lys Trp Asn Tyr Val Ser Thr Val 210 215 220 Ala Ser Glu Gly Ser Tyr Gly Glu Ser Gly Val Glu Ala Phe Ile Gln 225 230 235 240 Lys Ser Arg Glu Asp Gly Gly Val Cys Ile Ala Gln Ser Val Lys Ile 245 250 255 Pro Arg Glu Pro Lys Ala Gly Glu Phe Asp Lys Ile Ile Arg Arg Leu 260 265 270 Leu Glu Thr Ser Asn Ala Arg Ala Val Ile Ile Phe Ala Asn Glu Asp 275 280 285 Asp Ile Arg Arg Val Leu Glu Ala Ala Arg Arg Ala Asn Gln Thr Gly 290 295 300 His Phe Phe Trp Met Gly Ser Asp Ser Trp Gly Ser Lys Ile Ala Pro 305 310 315 320 Val Leu His Leu Glu Glu Val Ala Glu Gly Ala Val Thr Ile Leu Pro 325 330 335 Lys Arg Met Ser Val Arg Gly Phe Asp Arg Tyr Phe Ser Ser Arg Thr 340 345 350 Leu Asp Asn Asn Arg Arg Asn Ile Trp Phe Ala Glu Phe Trp Glu Asp 355 360 365 Asn Phe His Cys Lys Leu Ser Arg His Ala Leu Lys Lys Gly Ser His 370 375 380 Val Lys Lys Cys Thr Asn Arg Glu Arg Ile Gly Gln Asp Ser Ala Tyr 385 390 395 400 Glu Gln Glu Gly Lys Val Gln Phe Val Ile Asp Ala Val Tyr Ala Met 405 410 415 Gly His Ala Leu His Ala Met His Arg Asp Leu Cys Pro Gly Arg Val 420 425 430 Gly Leu Cys Pro Arg Met Asp Pro Val Asp Gly Thr Gln Leu Leu Lys 435 440 445 Tyr Ile Arg Asn Val Asn Phe Ser Gly Ile Ala Gly Asn Pro Val Thr 450 455 460 Phe Asn Glu Asn Gly Asp Ala Pro Gly Arg Tyr Asp Ile Tyr Gln Tyr 465 470 475 480 Gln Leu Arg Asn Asp Ser Ala Glu Tyr Lys Val Ile Gly Ser Trp Thr 485 490 495 Asp His Leu His Leu Arg Ile Glu Arg Met His Trp Pro Gly Ser Gly 500 505 510 Gln Gln Leu Pro Arg Ser Ile Cys Ser Leu Pro Cys Gln Pro Gly Glu 515 520 525 Arg Lys Lys Thr Val Lys Gly Met Pro Cys Cys Trp His Cys Glu Pro 530 535 540 Cys Thr Gly Tyr Gln Tyr Gln Val Asp Arg Tyr Thr Cys Lys Thr Cys 545 550 555 560 Pro Tyr Asp Met Arg Pro Thr Glu Asn Arg Thr Gly Cys Arg Pro Ile 565 570 575 Pro Ile Ile Lys Leu Glu Trp Gly Ser Pro Trp Ala Val Leu Pro Leu 580 585 590 Phe Leu Ala Val Val Gly Ile Ala Ala Thr Leu Phe Val Val Ile Thr 595 600 605 Phe Val Arg Tyr Asn Asp Thr Pro Ile Val Lys Ala Ser Gly Arg Glu 610 615 620 Leu Ser Tyr Val Leu Leu Ala Gly Ile Phe Leu Cys Tyr Ala Thr Thr 625 630 635 640 Phe Leu Met Ile Ala Glu Pro Asp Leu Gly Thr Cys Ser Leu Arg Arg 645 650 655 Ile Phe Leu Gly Leu Gly Met Ser Ile Ser Tyr Ala Ala Leu Leu Thr 660 665 670 Lys Thr Asn Arg Ile Tyr Arg Ile Phe Glu Gln Gly Lys Arg Ser Val 675 680 685 Ser Ala Pro Arg Phe Ile Ser Pro Ala Ser Gln Leu Ala Ile Thr Phe 690 695 700 Ser Leu Ile Ser Leu Gln Leu Leu Gly Ile Cys Val Trp Phe Val Val 705 710 715 720 Asp Pro Ser His Ser Val Val Asp Phe Gln Asp Gln Arg Thr Leu Asp 725 730 735 Pro Arg Phe Ala Arg Gly Val Leu Lys Cys Asp Ile Ser Asp Leu Ser 740 745 750 Leu Ile Cys Leu Leu Gly Tyr Ser Met Leu Leu Met Val Thr Cys Thr 755 760 765 Val Tyr Ala Ile Lys Thr Arg Gly Val Pro Glu Thr Phe Asn Glu Ala 770 775 780 Lys Pro Ile Gly Phe Thr Met Tyr Thr Thr Cys Ile Val Trp Leu Ala 785 790 795 800 Phe Ile Pro Ile Phe Phe Gly Thr Ser Gln Ser Ala Asp Lys Leu Tyr 805 810 815 Ile Gln Thr Thr Thr Leu Thr Val Ser Val Ser Leu Ser Ala Ser Val 820 825 830 Ser Leu Gly Met Leu Tyr Met Pro Lys Val Tyr Ile Ile Leu Phe His 835 840 845 Pro Glu Gln Asn Val Pro Lys Arg Lys Arg Ser Leu Lys Ala Val Val 850 855 860 Thr Ala Ala Thr Met Ser Asn Lys Phe Thr Gln Lys Gly Asn Phe Arg 865 870 875 880 Pro Asn Gly Glu Ala Lys Ser Glu Leu Cys Glu Asn Leu Glu Ala Pro 885 890 895 Ala Leu Ala Thr Lys Gln Thr Tyr Val Thr Tyr Thr Asn His Ala Ile 900 905 910 23 6122 DNA Homo sapiens 23 cggaggcccg ggcaggccgg ctgagctaac tccccagagc caaagtggaa ggcgcgcccc 60 gagcgccttc tccccaggac cccggtgtcc ctccccgcgc cccgagcccg cgctctcctt 120 cccccgccct cagagcgctc cccgcccctc tgtctccccg cagcccgcta gacgagccga 180 tggcgcggcc ccggagagcc cgggagccgc tgctcgtggc gctgctgccg ctggcgtggc 240 tggcgcaggc gggcctggcg cgcgcggcgg gctctgtgcg cctggcgggc ggcctgacgc 300 tgggcggcct gttcccggtg cacgcgcggg gcgcggcggg ccgggcgtgc gggccgctga 360 agaaggagca gggcgtgcac cggctggagg ccatgctgta cgcgctggac cgcgtcaacg 420 ccgaccccga gctgctgccc ggcgtgcgcc tgggcgcgcg gctgctggac acctgctcgc 480 gggacaccta cgcgctggag caggcgctga gcttcgtgca ggcgctgatc cgcggccgcg 540 gcgacggcga cgaggtgggc gtgcgctgcc cgggaggcgt ccctccgctg cgccccgcgc 600 cccccgagcg cgtcgtggcc gtcgtgggcg cctcggccag ctccgtctcc atcatggtcg 660 ccaacgtgct gcgcctgttt gcgatacccc agatcagcta tgcctccaca gccccggagc 720 tcagcgactc cacacgctat gacttcttct cccgggtggt gccacccgac tcctaccagg 780 cgcaggccat ggtggacatc gtgagggcac tgggatggaa ctatgtgtcc acgctggcct 840 ccgagggcaa ctatggcgaa agtggggttg aggccttcgt tcagatctcc cgagaggctg 900 ggggggtctg tattgcccag tctatcaaga ttcccaggga accaaagcca ggagagttca 960 gcaaggtgat caggagactc atggagacgc ccaacgcccg gggcatcatc atctttgcca 1020 atgaggatga catcaggcgg gtcctggagg cagctcgcca ggccaacctg accggccact 1080 tcctgtgggt cggctcagac agctggggag ccaagacctc acccatcttg agcctggagg 1140 acgtggccgt tggggccatc accatcctgc ccaaaagggc ctccatcgac ggatttgacc 1200 agtacttcat gactcgatcc ctggagaaca accgcaggaa catctggttc gccgagttct 1260 gggaagagaa ttttaactgc aaactgacca gctcaggtac ccagtcagat gattccaccc 1320 gcaaatgcac aggcgaggaa cgcatcggcc gggactccac ctacgagcag gagggcaagg 1380 tgcagtttgt gattgatgcg gtgtatgcca ttgcccacgc cctccacagc atgcaccagg 1440 cgctctgccc tgggcacaca ggcctgtgcc cggcgatgga acccaccgat gggcggatgc 1500 ttctgcagta cattcgagct gtccgcttca acggcagcgc aggaacccct gtgatgttca 1560 acgagaacgg ggatgcgccc gggcggtacg acatcttcca gtaccaggcg accaatggca 1620 gtgccagcag tggcgggtac caggcagtgg gccagtgggc agagaccctc agactggatg 1680 tggaggccct gcagtggtct ggcgaccccc acgaggtgcc ctcgtctctg tgcagcctgc 1740 cctgcgggcc gggggagcgg aagaagatgg tgaagggcgt cccctgctgt tggcactgcg 1800 aggcctgtga cgggtaccgc ttccaggtgg acgagttcac atgcgaggcc tgtcctgggg 1860 acatgaggcc cacgcccaac cacacgggct gccgccccac acctgtggtg cgcctgagct 1920 ggtcctcccc ctgggcagcc ccgccgctcc tcctggccgt gctgggcatc gtggccacta 1980 ccacggtggt ggccaccttc gtgcggtaca acaacacgcc catcgtccgg gcctcgggcc 2040 gagagctcag ctacgtcctc ctcaccggca tcttcctcat ctacgccatc accttcctca 2100 tggtggctga gcctggggcc gcggtctgtg ccgcccgcag gctcttcctg ggcctgggca 2160 cgaccctcag ctactctgcc ctgctcacca agaccaaccg tatctaccgc atctttgagc 2220 agggcaagcg ctcggtcaca ccccctccct tcatcagccc cacctcacag ctggtcatca 2280 ccttcagcct cacctccctg caggtggtgg ggatgatagc atggctgggg gcccggcccc 2340 cacacagcgt gattgactat gaggaacagc ggacagtgga ccccgagcag gccagagggg 2400 tgctcaagtg cgacatgtcg gatctgtctc tcatcggctg cctgggctac agcctcctgc 2460 tcatggtcac gtgcacagtg tacgccatca aggcccgtgg cgtgcccgag accttcaacg 2520 aggccaagcc catcggcttc accatgtaca ccacctgcat catctggctg gcattcgtgc 2580 ccatcttctt tggcactgcc cagtcagctg aaaagatcta catccagaca accacgctaa 2640 ccgtgtcctt gagcctgagt gcctcggtgt ccctcggcat gctctacgta cccaaaacct 2700 acgtcatcct cttccatcca gagcagaatg tgcagaagcg aaagcggagc ctcaaggcca 2760 cctccacggt ggcagcccca cccaagggcg aggatgcaga ggcccacaag tagcagggca 2820 ggtgggaacg ggactgcttg ctgcctctcc tttcttcctc ttgcctcgag gtggaagctg 2880 tatagagccc gggtccacgg tgaacagtca gtggcaggga gtttgccaag accatgctcc 2940 gcgtcggtgg ggctggcctt gagaaggaac tggacccagc tctaccccga ttccagcatg 3000 tgagcttcat gcttcctcac cacagaccag actcgcttcc catggtggga aacagccacc 3060 gagaaggttc tagctctaga aagggactaa acttattctc tcatccgaag tccaaagagg 3120 atgatgaagc cctgggcttt gcctggtttg cgggagattt cctcccctca gtcaaccccc 3180 ataacctggg gattgggcag tgtggaagaa cgtgtagacc ccagaatgaa acatggggtt 3240 ggagtggagg aggagctgtc tcagcaagag gagacctggg gctgtgcatc tggatggagg 3300 cactcaggcc tgggtaggat tcctctggca cggagggaga gaccctgggt gagacccctg 3360 tgagcatggg aagggcctgc agtgggcgcg ggagtgagct gaggaactgg ggtgcgcccc 3420 catgagattc ccaatgccat gggctttccc ccatcccccc gggattgggc aaggtcagac 3480 ttagagtaca gctgttttcc tcccctctgt gtactccctt aaatcacccc aaccttggcc 3540 aggcatggtg gctcacacct gtaatcccag cactttggga ggccgaggca ggtggatcac 3600 ctgaggtccg gagttcgaga ccagcctggc caatgtggtg aaaccctgtc tctactaaaa 3660 atacaaaaat tagccaggtg tgatggtggg tgcctgtaat cccagttact tgggaggctg 3720 aggcaggaga atcgcttgaa cctgggaggt ggaggttgca gtgagctgtg attgtgccac 3780 tgtactccag cctgggtgac agagcgagac tctgtctcaa aaaaacaaaa caaaaaaaca 3840 ccaaaaaaac ccccaaacct gaagaaattc agatacacgt gtgtaatgtt agtgatgtga 3900 gaacaaggag caggggtgca tttgtgttgt gttcgggttg gggatgggtt taggagctcc 3960 aggttgggag cagtgacaga gagtcatggc cgtggtgagg gtgaatccca agtggatggc 4020 tcaggacggg tatggaaacc cttcattcct cataggtact gggaagtcca tttgcaagct 4080 gagcgccagg cctggggagg aagaggcttg ggctgcagat gcacgcacat ttgtttttca 4140 ctgatagttt ttacaaaaag cttggtttaa gttatggaat tttatgtccc tgggagtaga 4200 atttacattt gttaaattga ccactgttta agatcagtat acattctcta gtctgtgatg 4260 tctggagcta gttttgaggg tgaaccacac tttatccaac atacaaactt tcccatgcag 4320 cttctctggt gcgcagttgg ttttgaccgt gggactaggt gcttctgcag gttttaagta 4380 attaacttaa aagcttctcc tctgagaaac atttctgttg cgctactgac tctccttctc 4440 cacatttgtt gtgttcctag ggcttctcta tagtgcacat taggacgttt catttgttgc 4500 tgaatgcttt ccagaattat ttattccata gggtttctct cctgtgcagc tctctcatgg 4560 gtaatggggc gtgttttctt gccaaaggcg gttccaccct cgtgattgta tagggctctt 4620 ctcctgtatg aactctgaga tcagtgagct ctgatctcca agggaaagtt ttcctgcatt 4680 tgctgttttc tcatgtctct cccagtgtga attctctggc ttctagctga aaacttttcc 4740 acagttttac attcatgtgg ttttctccac tgtgaactct gtgattcaga atcagaagca 4800 gttcttagta gaggcatttc tacactgatt gcactgagga tatctcccca gtgtgaagtt 4860 tctggcatag agtcctggct tcccgcagac gactttcaca ctctgccatg ttcatgcctg 4920 tgggcctctc tggcaggaac tctgatgcac cgcgaggccc atgtactcct gtggctttct 4980 cacattcggt ctacttgcag ggtatctcca cagcatgcac cattctgggt acagggggac 5040 atcctctgtt actgaagatg ttgtcatatt tagtaccttc acaaggtttc tctccttcca 5100 gaattttctg atgtacacaa ataactgact tccacaagag ggcttttcca cactcggtgt 5160 gtgcatacag tttctgcctg tgatcatttc tttatgttat tattttattt tttcgagata 5220 gggtcttgct caatttctta ggctggagtg cagtggcacg atcatagctc actgaagttt 5280 cgacctgggc tcaagcaatc ctcccgcttc agcctcctga gtagctggtg cgcacgacca 5340 tacccagcta atgttttatt ttttgtagag acgaggtctc actatgttgc ccaggctggt 5400 ctcgaacttc tgagctcgag cgatcctcct gcctccacct cccaaagtgt tcggattaca 5460 aacgtgagcc atcgcaccta gcctctttga tcatttctgt ggtgttcagt gggggttgac 5520 agctccctaa agattttcct gtttttttgc atgcatgggt ttgaattctt tgaggtccaa 5580 tttatttgga cccctgaata aagttttgtg ggttttcttc tatgtgtgga attatatagg 5640 cattcttcca gtgtggtttc tcttatgtcg agtgagagct gacctgcacc gaagtttgtc 5700 ccatttgttg cccttgaatt atctgtatga attatatgtt ccagtgaaaa tggagttctg 5760 ggttggaggc ttattccatg tttacacaat taaaattgca gtgttcctct ctgggatgag 5820 agctctaaag cagagtaaga ttacgttctg atgtaagctt taaccaccta tttataaggt 5880 ctcacctgtg gtccactgtg ttgagacttc tacagaagag cttctgtata gtaaccattt 5940 tcttaggctg tctcacttgt gtgaatcttc tgacacattt attatagctt tgtcccattt 6000 cttatccttt ttgctcttta gaaatttccc tttaatttat tacattcatt gcttactgta 6060 aagagtccag gtaactgact ttaattcaag ttacttcctg ttcaataaat ttaacttttc 6120 cc 6122 24 877 PRT Homo sapiens 24 Met Ala Arg Pro Arg Arg Ala Arg Glu Pro Leu Leu Val Ala Leu Leu 1 5 10 15 Pro Leu Ala Trp Leu Ala Gln Ala Gly Leu Ala Arg Ala Ala Gly Ser 20 25 30 Val Arg Leu Ala Gly Gly Leu Thr Leu Gly Gly Leu Phe Pro Val His 35 40 45 Ala Arg Gly Ala Ala Gly Arg Ala Cys Gly Pro Leu Lys Lys Glu Gln 50 55 60 Gly Val His Arg Leu Glu Ala Met Leu Tyr Ala Leu Asp Arg Val Asn 65 70 75 80 Ala Asp Pro Glu Leu Leu Pro Gly Val Arg Leu Gly Ala Arg Leu Leu 85 90 95 Asp Thr Cys Ser Arg Asp Thr Tyr Ala Leu Glu Gln Ala Leu Ser Phe 100 105 110 Val Gln Ala Leu Ile Arg Gly Arg Gly Asp Gly Asp Glu Val Gly Val 115 120 125 Arg Cys Pro Gly Gly Val Pro Pro Leu Arg Pro Ala Pro Pro Glu Arg 130 135 140 Val Val Ala Val Val Gly Ala Ser Ala Ser Ser Val Ser Ile Met Val 145 150 155 160 Ala Asn Val Leu Arg Leu Phe Ala Ile Pro Gln Ile Ser Tyr Ala Ser 165 170 175 Thr Ala Pro Glu Leu Ser Asp Ser Thr Arg Tyr Asp Phe Phe Ser Arg 180 185 190 Val Val Pro Pro Asp Ser Tyr Gln Ala Gln Ala Met Val Asp Ile Val 195 200 205 Arg Ala Leu Gly Trp Asn Tyr Val Ser Thr Leu Ala Ser Glu Gly Asn 210 215 220 Tyr Gly Glu Ser Gly Val Glu Ala Phe Val Gln Ile Ser Arg Glu Ala 225 230 235 240 Gly Gly Val Cys Ile Ala Gln Ser Ile Lys Ile Pro Arg Glu Pro Lys 245 250 255 Pro Gly Glu Phe Ser Lys Val Ile Arg Arg Leu Met Glu Thr Pro Asn 260 265 270 Ala Arg Gly Ile Ile Ile Phe Ala Asn Glu Asp Asp Ile Arg Arg Val 275 280 285 Leu Glu Ala Ala Arg Gln Ala Asn Leu Thr Gly His Phe Leu Trp Val 290 295 300 Gly Ser Asp Ser Trp Gly Ala Lys Thr Ser Pro Ile Leu Ser Leu Glu 305 310 315 320 Asp Val Ala Val Gly Ala Ile Thr Ile Leu Pro Lys Arg Ala Ser Ile 325 330 335 Asp Gly Phe Asp Gln Tyr Phe Met Thr Arg Ser Leu Glu Asn Asn Arg 340 345 350 Arg Asn Ile Trp Phe Ala Glu Phe Trp Glu Glu Asn Phe Asn Cys Lys 355 360 365 Leu Thr Ser Ser Gly Thr Gln Ser Asp Asp Ser Thr Arg Lys Cys Thr 370 375 380 Gly Glu Glu Arg Ile Gly Arg Asp Ser Thr Tyr Glu Gln Glu Gly Lys 385 390 395 400 Val Gln Phe Val Ile Asp Ala Val Tyr Ala Ile Ala His Ala Leu His 405 410 415 Ser Met His Gln Ala Leu Cys Pro Gly His Thr Gly Leu Cys Pro Ala 420 425 430 Met Glu Pro Thr Asp Gly Arg Met Leu Leu Gln Tyr Ile Arg Ala Val 435 440 445 Arg Phe Asn Gly Ser Ala Gly Thr Pro Val Met Phe Asn Glu Asn Gly 450 455 460 Asp Ala Pro Gly Arg Tyr Asp Ile Phe Gln Tyr Gln Ala Thr Asn Gly 465 470 475 480 Ser Ala Ser Ser Gly Gly Tyr Gln Ala Val Gly Gln Trp Ala Glu Thr 485 490 495 Leu Arg Leu Asp Val Glu Ala Leu Gln Trp Ser Gly Asp Pro His Glu 500 505 510 Val Pro Ser Ser Leu Cys Ser Leu Pro Cys Gly Pro Gly Glu Arg Lys 515 520 525 Lys Met Val Lys Gly Val Pro Cys Cys Trp His Cys Glu Ala Cys Asp 530 535 540 Gly Tyr Arg Phe Gln Val Asp Glu Phe Thr Cys Glu Ala Cys Pro Gly 545 550 555 560 Asp Met Arg Pro Thr Pro Asn His Thr Gly Cys Arg Pro Thr Pro Val 565 570 575 Val Arg Leu Ser Trp Ser Ser Pro Trp Ala Ala Pro Pro Leu Leu Leu 580 585 590 Ala Val Leu Gly Ile Val Ala Thr Thr Thr Val Val Ala Thr Phe Val 595 600 605 Arg Tyr Asn Asn Thr Pro Ile Val Arg Ala Ser Gly Arg Glu Leu Ser 610 615 620 Tyr Val Leu Leu Thr Gly Ile Phe Leu Ile Tyr Ala Ile Thr Phe Leu 625 630 635 640 Met Val Ala Glu Pro Gly Ala Ala Val Cys Ala Ala Arg Arg Leu Phe 645 650 655 Leu Gly Leu Gly Thr Thr Leu Ser Tyr Ser Ala Leu Leu Thr Lys Thr 660 665 670 Asn Arg Ile Tyr Arg Ile Phe Glu Gln Gly Lys Arg Ser Val Thr Pro 675 680 685 Pro Pro Phe Ile Ser Pro Thr Ser Gln Leu Val Ile Thr Phe Ser Leu 690 695 700 Thr Ser Leu Gln Val Val Gly Met Ile Ala Trp Leu Gly Ala Arg Pro 705 710 715 720 Pro His Ser Val Ile Asp Tyr Glu Glu Gln Arg Thr Val Asp Pro Glu 725 730 735 Gln Ala Arg Gly Val Leu Lys Cys Asp Met Ser Asp Leu Ser Leu Ile 740 745 750 Gly Cys Leu Gly Tyr Ser Leu Leu Leu Met Val Thr Cys Thr Val Tyr 755 760 765 Ala Ile Lys Ala Arg Gly Val Pro Glu Thr Phe Asn Glu Ala Lys Pro 770 775 780 Ile Gly Phe Thr Met Tyr Thr Thr Cys Ile Ile Trp Leu Ala Phe Val 785 790 795 800 Pro Ile Phe Phe Gly Thr Ala Gln Ser Ala Glu Lys Ile Tyr Ile Gln 805 810 815 Thr Thr Thr Leu Thr Val Ser Leu Ser Leu Ser Ala Ser Val Ser Leu 820 825 830 Gly Met Leu Tyr Val Pro Lys Thr Tyr Val Ile Leu Phe His Pro Glu 835 840 845 Gln Asn Val Gln Lys Arg Lys Arg Ser Leu Lys Ala Thr Ser Thr Val 850 855 860 Ala Ala Pro Pro Lys Gly Glu Asp Ala Glu Ala His Lys 865 870 875 25 4021 DNA Homo sapiens 25 ggtcgcccct ccccggattc ccccaccctc cgtgcctgca ggagcccctg ggctttcccg 60 gaggagctcg ccctgaaggg cccggacctc ggcgagccca ccaccgttcc ctccagcgcc 120 gccgccgcca ccgcagcagc cggagcagca tggtccagct gaggaagctg ctccgcgtcc 180 tgactttgat gaagttcccc tgctgcgtgc tggaggtgct cctgtgcgcg ctggcggcgg 240 cggcgcgcgg ccaggagatg tacgccccgc actcaatccg gatcgagggg gacgtcaccc 300 tcggggggct gttccccgtg cacgccaagg gtcccagcgg agtgccctgc ggcgacatca 360 agagggaaaa cgggatccac aggctggaag cgatgctcta cgccctggac cagatcaaca 420 gtgatcccaa cctactgccc aacgtgacgc tgggcgcgcg gatcctggac acttgttcca 480 gggacactta cgcgctcgaa cagtcgctta ctttcgtcca ggcgctcatc cagaaggaca 540 cctccgacgt gcgctgcacc aacggcgaac cgccggtttt cgtcaagccg gagaaagtag 600 ttggagtgat tggggcttcg gggagttcgg tctccatcat ggtagccaac atcctgaggc 660 tcttccagat cccccagatt agttatgcat caacggcacc cgagctaagt gatgaccggc 720 gctatgactt cttctctcgc gtggtgccac ccgattcctt ccaagcccag gccatggtag 780 acattgtaaa ggccctaggc tggaattatg tgtctaccct cgcatcggaa ggaagttatg 840 gagagaaagg tgtggagtcc ttcacgcaga tttccaaaga ggcaggtgga ctctgcattg 900 cccagtccgt gagaatcccc caggaacgca aagacaggac cattgacttt gatagaatta 960 tcaaacagct cctggacacc cccaactcca gggccgtcgt gatttttgcc aacgatgagg 1020 atataaagca gatccttgca gcagccaaaa gagctgacca agttggccat tttctttggg 1080 tgggatcaga cagctgggga tccaaaataa acccactgca ccagcatgaa gatatcgcag 1140 aaggggccat caccattcag cccaagcgag ccacggtgga agggtttgat gcctacttta 1200 cgtcccgtac acttgaaaac aacagaagaa atgtatggtt tgccgaatac tgggaggaaa 1260 acttcaactg caagttgacg attagtgggt caaaaaaaga agacacagat cgcaaatgca 1320 caggacagga gagaattgga aaagattcca actatgagca ggagggtaaa gtccagttcg 1380 tgattgacgc agtctatgct atggctcacg cccttcacca catgaacaag gatctctgtg 1440 ctgactaccg gggtgtctgc ccagagatgg agcaagctgg aggcaagaag ttgctgaagt 1500 atatacgcaa tgttaatttc aatggtagtg ctggcactcc agtgatgttt aacaagaacg 1560 gggatgcacc tgggcgttat gacatctttc agtaccagac cacaaacacc agcaacccgg 1620 gttaccgtct gatcgggcag tggacagacg aacttcagct caatatagaa gacatgcagt 1680 ggggtaaagg agtccgagag atacccgcct cagtgtgcac actaccatgt aagccaggac 1740 agagaaagaa gacacagaaa ggaactcctt gctgttggac ctgtgagcct tgcgatggtt 1800 accagtacca gtttgatgag atgacatgcc agcattgccc ctatgaccag aggcccaatg 1860 aaaatcgaac cggatgccag gatattccca tcatcaaact ggagtggcac tccccctggg 1920 ctgtgattcc tgtcttcctg gcaatgttgg ggatcattgc caccatcttt gtcatggcca 1980 ctttcatccg ctacaatgac acgcccattg tccgggcatc tgggcgggaa ctcagctatg 2040 ttcttttgac gggcatcttt ctttgctaca tcatcacttt cctgatgatt gccaaaccag 2100 atgtggcagt gtgttctttc cggcgagttt tcttgggctt gggtatgtgc atcagttatg 2160 cagccctctt gacgaaaaca aatcggattt atcgcatatt tgagcagggc aagaaatcag 2220 taacagctcc cagactcata agcccaacat cacaactggc aatcacttcc agtttaatat 2280 cagttcagct tctaggggtg ttcatttggt ttggtgttga tccacccaac atcatcatag 2340 actatgatga acacaagaca atgaaccctg agcaagccag aggggttctc aagtgtgaca 2400 ttacagatct ccaaatcatt tgctccttgg gatatagcat tcttctcatg gtcacatgta 2460 ctgtgtatgc catcaagact cggggtgtac ccgagaattt taacgaagcc aagcccattg 2520 gattcactat gtacacgaca tgtatagtat ggcttgcctt cattccaatt ttttttggca 2580 ccgctcaatc agcggaaaag ctctacatac aaactaccac gcttacaatc tccatgaacc 2640 taagtgcatc agtggcgctg gggatgctat acatgccgaa agtgtacatc atcattttcc 2700 accctgaact caatgtccag aaacggaagc gaagcttcaa ggcggtagtc acagcagcca 2760 ccatgtcatc gaggctgtca cacaaaccca gtgacagacc caacggtgag gcaaagaccg 2820 agctctgtga aaacgtagac ccaaacagcc ctgctgcaaa aaagaagtat gtcagttata 2880 ataacctggt tatctaacct gttccattcc atggaaccat ggaggaggaa gaccctcagt 2940 tattttgtca cccaacctgg cataggactc tttggtccta cccgcttccc atcaccggag 3000 gagcttcccc ggccgggaga ccagtgttag aggatccaag cgacctaaac agctgcttta 3060 tgaaatatcc ttactttatc tgggcttaat aagtcactga catcagcact gccaactcgg 3120 ctgcaattgt ggaccttccc taccaaaggg agtgttgaaa ctcaagtccc gccctggctc 3180 tttagaatgg accactgaga gccacaggac cgttttgggg ctgacctgtc ttattacgta 3240 tgtacttcta ggttgcaagg ttttgaaatt ttctgtacag tttgtgagga cctttgcact 3300 ttgccatctg atgtcgtacc tcggttcact gtttgttttc gaatgccttg ttttcataga 3360 gccctattct ctcagacggt ggaatatttg gaaaaatttt aaaacaatta aaattttaaa 3420 gcaatcttgg cagactaaaa caagtacatc tgtacatgac tgtataatta cgattatagt 3480 accactgcac atcatgtttt tttttttaag acaaaaaaga tgtttaaaga ccaaaaactg 3540 tgctgagaaa gtatgcccca cctatctttg gtatatgata ggttacataa aaggaaggta 3600 ttggctgaac tgaatagagg tcttgatctt tggaatgcat gccagtaatg tattttacag 3660 tacatgttta ttatgttcaa tatttgtatt tgtgttctct tttgttattt ttaattaggg 3720 tatatgaata ttttgcaata attttaataa ttattaagct gtttgaagga aagaatatgg 3780 atttttcatg tcttgaggtt ttgttcatgc cccctttgac tgatcagtgt gataaggact 3840 ttaggaaaaa aagcatgtat gttttttact gtttgtaata agtactttcg ttaatcttgc 3900 tgcttatgtg ccaatttagt ggaaaaaaac aacccttgct gaaaaattcc ctctttccat 3960 tctctttcaa ttctgtgata ttgtccaaga atgtatcaat aaaatacttt ggttaacttt 4020 t 4021 26 915 PRT Homo sapiens 26 Met Val Gln Leu Arg Lys Leu Leu Arg Val Leu Thr Leu Met Lys Phe 1 5 10 15 Pro Cys Cys Val Leu Glu Val Leu Leu Cys Ala Leu Ala Ala Ala Ala 20 25 30 Arg Gly Gln Glu Met Tyr Ala Pro His Ser Ile Arg Ile Glu Gly Asp 35 40 45 Val Thr Leu Gly Gly Leu Phe Pro Val His Ala Lys Gly Pro Ser Gly 50 55 60 Val Pro Cys Gly Asp Ile Lys Arg Glu Asn Gly Ile His Arg Leu Glu 65 70 75 80 Ala Met Leu Tyr Ala Leu Asp Gln Ile Asn Ser Asp Pro Asn Leu Leu 85 90 95 Pro Asn Val Thr Leu Gly Ala Arg Ile Leu Asp Thr Cys Ser Arg Asp 100 105 110 Thr Tyr Ala Leu Glu Gln Ser Leu Thr Phe Val Gln Ala Leu Ile Gln 115 120 125 Lys Asp Thr Ser Asp Val Arg Cys Thr Asn Gly Glu Pro Pro Val Phe 130 135 140 Val Lys Pro Glu Lys Val Val Gly Val Ile Gly Ala Ser Gly Ser Ser 145 150 155 160 Val Ser Ile Met Val Ala Asn Ile Leu Arg Leu Phe Gln Ile Pro Gln 165 170 175 Ile Ser Tyr Ala Ser Thr Ala Pro Glu Leu Ser Asp Asp Arg Arg Tyr 180 185 190 Asp Phe Phe Ser Arg Val Val Pro Pro Asp Ser Phe Gln Ala Gln Ala 195 200 205 Met Val Asp Ile Val Lys Ala Leu Gly Trp Asn Tyr Val Ser Thr Leu 210 215 220 Ala Ser Glu Gly Ser Tyr Gly Glu Lys Gly Val Glu Ser Phe Thr Gln 225 230 235 240 Ile Ser Lys Glu Ala Gly Gly Leu Cys Ile Ala Gln Ser Val Arg Ile 245 250 255 Pro Gln Glu Arg Lys Asp Arg Thr Ile Asp Phe Asp Arg Ile Ile Lys 260 265 270 Gln Leu Leu Asp Thr Pro Asn Ser Arg Ala Val Val Ile Phe Ala Asn 275 280 285 Asp Glu Asp Ile Lys Gln Ile Leu Ala Ala Ala Lys Arg Ala Asp Gln 290 295 300 Val Gly His Phe Leu Trp Val Gly Ser Asp Ser Trp Gly Ser Lys Ile 305 310 315 320 Asn Pro Leu His Gln His Glu Asp Ile Ala Glu Gly Ala Ile Thr Ile 325 330 335 Gln Pro Lys Arg Ala Thr Val Glu Gly Phe Asp Ala Tyr Phe Thr Ser 340 345 350 Arg Thr Leu Glu Asn Asn Arg Arg Asn Val Trp Phe Ala Glu Tyr Trp 355 360 365 Glu Glu Asn Phe Asn Cys Lys Leu Thr Ile Ser Gly Ser Lys Lys Glu 370 375 380 Asp Thr Asp Arg Lys Cys Thr Gly Gln Glu Arg Ile Gly Lys Asp Ser 385 390 395 400 Asn Tyr Glu Gln Glu Gly Lys Val Gln Phe Val Ile Asp Ala Val Tyr 405 410 415 Ala Met Ala His Ala Leu His His Met Asn Lys Asp Leu Cys Ala Asp 420 425 430 Tyr Arg Gly Val Cys Pro Glu Met Glu Gln Ala Gly Gly Lys Lys Leu 435 440 445 Leu Lys Tyr Ile Arg Asn Val Asn Phe Asn Gly Ser Ala Gly Thr Pro 450 455 460 Val Met Phe Asn Lys Asn Gly Asp Ala Pro Gly Arg Tyr Asp Ile Phe 465 470 475 480 Gln Tyr Gln Thr Thr Asn Thr Ser Asn Pro Gly Tyr Arg Leu Ile Gly 485 490 495 Gln Trp Thr Asp Glu Leu Gln Leu Asn Ile Glu Asp Met Gln Trp Gly 500 505 510 Lys Gly Val Arg Glu Ile Pro Ala Ser Val Cys Thr Leu Pro Cys Lys 515 520 525 Pro Gly Gln Arg Lys Lys Thr Gln Lys Gly Thr Pro Cys Cys Trp Thr 530 535 540 Cys Glu Pro Cys Asp Gly Tyr Gln Tyr Gln Phe Asp Glu Met Thr Cys 545 550 555 560 Gln His Cys Pro Tyr Asp Gln Arg Pro Asn Glu Asn Arg Thr Gly Cys 565 570 575 Gln Asp Ile Pro Ile Ile Lys Leu Glu Trp His Ser Pro Trp Ala Val 580 585 590 Ile Pro Val Phe Leu Ala Met Leu Gly Ile Ile Ala Thr Ile Phe Val 595 600 605 Met Ala Thr Phe Ile Arg Tyr Asn Asp Thr Pro Ile Val Arg Ala Ser 610 615 620 Gly Arg Glu Leu Ser Tyr Val Leu Leu Thr Gly Ile Phe Leu Cys Tyr 625 630 635 640 Ile Ile Thr Phe Leu Met Ile Ala Lys Pro Asp Val Ala Val Cys Ser 645 650 655 Phe Arg Arg Val Phe Leu Gly Leu Gly Met Cys Ile Ser Tyr Ala Ala 660 665 670 Leu Leu Thr Lys Thr Asn Arg Ile Tyr Arg Ile Phe Glu Gln Gly Lys 675 680 685 Lys Ser Val Thr Ala Pro Arg Leu Ile Ser Pro Thr Ser Gln Leu Ala 690 695 700 Ile Thr Ser Ser Leu Ile Ser Val Gln Leu Leu Gly Val Phe Ile Trp 705 710 715 720 Phe Gly Val Asp Pro Pro Asn Ile Ile Ile Asp Tyr Asp Glu His Lys 725 730 735 Thr Met Asn Pro Glu Gln Ala Arg Gly Val Leu Lys Cys Asp Ile Thr 740 745 750 Asp Leu Gln Ile Ile Cys Ser Leu Gly Tyr Ser Ile Leu Leu Met Val 755 760 765 Thr Cys Thr Val Tyr Ala Ile Lys Thr Arg Gly Val Pro Glu Asn Phe 770 775 780 Asn Glu Ala Lys Pro Ile Gly Phe Thr Met Tyr Thr Thr Cys Ile Val 785 790 795 800 Trp Leu Ala Phe Ile Pro Ile Phe Phe Gly Thr Ala Gln Ser Ala Glu 805 810 815 Lys Leu Tyr Ile Gln Thr Thr Thr Leu Thr Ile Ser Met Asn Leu Ser 820 825 830 Ala Ser Val Ala Leu Gly Met Leu Tyr Met Pro Lys Val Tyr Ile Ile 835 840 845 Ile Phe His Pro Glu Leu Asn Val Gln Lys Arg Lys Arg Ser Phe Lys 850 855 860 Ala Val Val Thr Ala Ala Thr Met Ser Ser Arg Leu Ser His Lys Pro 865 870 875 880 Ser Asp Arg Pro Asn Gly Glu Ala Lys Thr Glu Leu Cys Glu Asn Val 885 890 895 Asp Pro Asn Ser Pro Ala Ala Lys Lys Lys Tyr Val Ser Tyr Asn Asn 900 905 910 Leu Val Ile 915 27 3321 DNA Homo sapiens 27 tgctgtgttg caagaataaa ctttgggtct tggattgcaa taccacctgt ggagaaaatg 60 gtatgcgagg gaaagcgatc agcctcttgc ccttgtttct tcctcttgac cgccaagttc 120 tactggatcc tcacaatgat gcaaagaact cacagccagg agtatgccca ttccatacgg 180 gtggatgggg acattatttt ggggggtctc ttccctgtcc acgcaaaggg agagagaggg 240 gtgccttgtg gggagctgaa gaaggaaaag gggattcaca gactggaggc catgctttat 300 gcaattgacc agattaacaa ggaccctgat ctcctttcca acatcactct gggtgtccgc 360 atcctcgaca cgtgctctag ggacacctat gctttggagc agtctctaac attcgtgcag 420 gcattaatag agaaagatgc ttcggatgtg aagtgtgcta atggagatcc acccattttc 480 accaagcccg acaagatttc tggcgtcata ggtgctgcag caagctccgt gtccatcatg 540 gttgctaaca ttttaagact ttttaagata cctcaaatca gctatgcatc cacagcccca 600 gagctaagtg ataacaccag gtatgacttt ttctctcgag tggttccgcc tgactcctac 660 caagcccaag ccatggtgga catcgtgaca gcactgggat ggaattatgt ttcgacactg 720 gcttctgagg ggaactatgg tgagagcggt gtggaggcct tcacccagat ctcgagggag 780 attggtggtg tttgcattgc tcagtcacag aaaatcccac gtgaaccaag acctggagaa 840 tttgaaaaaa ttatcaaacg cctgctagaa acacctaatg ctcgagcagt gattatgttt 900 gccaatgagg atgacatcag gaggatattg gaagcagcaa aaaaactaaa ccaaagtggg 960 cattttctct ggattggctc agatagttgg ggatccaaaa tagcacctgt ctatcagcaa 1020 gaggagattg cagaaggggc tgtgacaatt ttgcccaaac gagcatcaat tgatggattt 1080 gatcgatact ttagaagccg aactcttgcc aataatcgaa gaaatgtgtg gtttgcagaa 1140 ttctgggagg agaattttgg ctgcaagtta ggatcacatg ggaaaaggaa cagtcatata 1200 aagaaatgca cagggctgga gcgaattgct cgggattcat cttatgaaca ggaaggaaag 1260 gtccaatttg taattgatgc tgtatattcc atggcttacg ccctgcacaa tatgcacaaa 1320 gatctctgcc ctggatacat tggcctttgt ccacgaatga gtaccattga tgggaaagag 1380 ctacttggtt atattcgggc tgtaaatttt aatggcagtg ctggcactcc tgtcactttt 1440 aatgaaaacg gagatgctcc tggacgttat gatatcttcc agtatcaaat aaccaacaaa 1500 agcacagagt acaaagtcat cggccactgg accaatcagc ttcatctaaa agtggaagac 1560 atgcagtggg ctcatagaga acatactcac ccggcgtctg tctgcagcct gccgtgtaag 1620 ccaggggaga ggaagaaaac ggtgaaaggg gtcccttgct gctggcactg tgaacgctgt 1680 gaaggttaca actaccaggt ggatgagctg tcctgtgaac tttgccctct ggatcagaga 1740 cccaacatga accgcacagg ctgccagctt atccccatca tcaaattgga gtggcattct 1800 ccctgggctg tggtgcctgt gtttgttgca atattgggaa tcatcgccac cacctttgtg 1860 atcgtgacct ttgtccgcta taatgacaca cctatcgtga gggcttcagg acgcgaactt 1920 agttacgtgc tcctaacggg gatttttctc tgttattcaa tcacgttttt aatgattgca 1980 gcaccagata caatcatatg ctccttccga cgggtcttcc taggacttgg catgtgtttc 2040 agctatgcag cccttctgac caaaacaaac cgtatccacc gaatatttga gcaggggaag 2100 aaatctgtca cagcgcccaa gttcattagt ccagcatctc agctggtgat caccttcagc 2160 ctcatctccg tccagctcct tggagtgttt gtctggtttg ttgtggatcc cccccacatc 2220 atcattgact atggagagca gcggacacta gatccagaga aggccagggg agtgctcaag 2280 tgtgacattt ctgatctctc actcatttgt tcacttggat acagtatcct cttgatggtc 2340 acttgtactg tttatgccaa taaaacgaga ggtgtcccag agactttcaa tgaagccaaa 2400 cctattggat ttaccatgta taccacctgc atcatttggt tagctttcat ccccatcttt 2460 tttggtacag cccagtcagc agaaaagatg tacatccaga caacaacact tactgtctcc 2520 atgagtttaa gtgcttcagt atctctgggc atgctctata tgcccaaggt ttatattata 2580 atttttcatc cagaacagaa tgttcaaaaa cgcaagagga gcttcaaggc tgtggtgaca 2640 gctgccacca tgcaaagcaa actgatccaa aaaggaaatg acagaccaaa tggcgaggtg 2700 aaaagtgaac tctgtgagag tcttgaaacc aacacttcct ctaccaagac aacatatatc 2760 agttacagca atcattcaat ctgaaacagg gaaatggcac aatctgaaga gacgtggtat 2820 atgatcttaa atgatgaaca tgagaccgca aaaattcact cctggagatc tccgtagact 2880 acaatcaatc aaatcaatag tcagtcttgt aaggaacaaa aattagccat gagccaaaag 2940 tatcaataaa cggggagtga agaaacccgt tttatacaat aaaaccaatg agtgtcaagc 3000 taaagtattg cttattcatg agcagttaaa acaaatcaca aaaggaaaac taatgttagc 3060 tcgtgaaaaa aatgctgttg aaataaataa tgtctgatgt tattcttgta tttttctgtg 3120 attgtgagaa ctcccgttcc tgtcccacat tgtttaactt gtataagaca atgagtctgt 3180 ttcttgtaat ggctgaccag attgaagccc tgggttgtgc taaaaataaa tgcaatgatt 3240 gatgcatgca attttttata caaataattt atttctaata ataaaggaat gttttgcaaa 3300 aaaaaaaaaa aaaaactcga g 3321 28 908 PRT Homo sapiens 28 Met Val Cys Glu Gly Lys Arg Ser Ala Ser Cys Pro Cys Phe Phe Leu 1 5 10 15 Leu Thr Ala Lys Phe Tyr Trp Ile Leu Thr Met Met Gln Arg Thr His 20 25 30 Ser Gln Glu Tyr Ala His Ser Ile Arg Val Asp Gly Asp Ile Ile Leu 35 40 45 Gly Gly Leu Phe Pro Val His Ala Lys Gly Glu Arg Gly Val Pro Cys 50 55 60 Gly Glu Leu Lys Lys Glu Lys Gly Ile His Arg Leu Glu Ala Met Leu 65 70 75 80 Tyr Ala Ile Asp Gln Ile Asn Lys Asp Pro Asp Leu Leu Ser Asn Ile 85 90 95 Thr Leu Gly Val Arg Ile Leu Asp Thr Cys Ser Arg Asp Thr Tyr Ala 100 105 110 Leu Glu Gln Ser Leu Thr Phe Val Gln Ala Leu Ile Glu Lys Asp Ala 115 120 125 Ser Asp Val Lys Cys Ala Asn Gly Asp Pro Pro Ile Phe Thr Lys Pro 130 135 140 Asp Lys Ile Ser Gly Val Ile Gly Ala Ala Ala Ser Ser Val Ser Ile 145 150 155 160 Met Val Ala Asn Ile Leu Arg Leu Phe Lys Ile Pro Gln Ile Ser Tyr 165 170 175 Ala Ser Thr Ala Pro Glu Leu Ser Asp Asn Thr Arg Tyr Asp Phe Phe 180 185 190 Ser Arg Val Val Pro Pro Asp Ser Tyr Gln Ala Gln Ala Met Val Asp 195 200 205 Ile Val Thr Ala Leu Gly Trp Asn Tyr Val Ser Thr Leu Ala Ser Glu 210 215 220 Gly Asn Tyr Gly Glu Ser Gly Val Glu Ala Phe Thr Gln Ile Ser Arg 225 230 235 240 Glu Ile Gly Gly Val Cys Ile Ala Gln Ser Gln Lys Ile Pro Arg Glu 245 250 255 Pro Arg Pro Gly Glu Phe Glu Lys Ile Ile Lys Arg Leu Leu Glu Thr 260 265 270 Pro Asn Ala Arg Ala Val Ile Met Phe Ala Asn Glu Asp Asp Ile Arg 275 280 285 Arg Ile Leu Glu Ala Ala Lys Lys Leu Asn Gln Ser Gly His Phe Leu 290 295 300 Trp Ile Gly Ser Asp Ser Trp Gly Ser Lys Ile Ala Pro Val Tyr Gln 305 310 315 320 Gln Glu Glu Ile Ala Glu Gly Ala Val Thr Ile Leu Pro Lys Arg Ala 325 330 335 Ser Ile Asp Gly Phe Asp Arg Tyr Phe Arg Ser Arg Thr Leu Ala Asn 340 345 350 Asn Arg Arg Asn Val Trp Phe Ala Glu Phe Trp Glu Glu Asn Phe Gly 355 360 365 Cys Lys Leu Gly Ser His Gly Lys Arg Asn Ser His Ile Lys Lys Cys 370 375 380 Thr Gly Leu Glu Arg Ile Ala Arg Asp Ser Ser Tyr Glu Gln Glu Gly 385 390 395 400 Lys Val Gln Phe Val Ile Asp Ala Val Tyr Ser Met Ala Tyr Ala Leu 405 410 415 His Asn Met His Lys Asp Leu Cys Pro Gly Tyr Ile Gly Leu Cys Pro 420 425 430 Arg Met Ser Thr Ile Asp Gly Lys Glu Leu Leu Gly Tyr Ile Arg Ala 435 440 445 Val Asn Phe Asn Gly Ser Ala Gly Thr Pro Val Thr Phe Asn Glu Asn 450 455 460 Gly Asp Ala Pro Gly Arg Tyr Asp Ile Phe Gln Tyr Gln Ile Thr Asn 465 470 475 480 Lys Ser Thr Glu Tyr Lys Val Ile Gly His Trp Thr Asn Gln Leu His 485 490 495 Leu Lys Val Glu Asp Met Gln Trp Ala His Arg Glu His Thr His Pro 500 505 510 Ala Ser Val Cys Ser Leu Pro Cys Lys Pro Gly Glu Arg Lys Lys Thr 515 520 525 Val Lys Gly Val Pro Cys Cys Trp His Cys Glu Arg Cys Glu Gly Tyr 530 535 540 Asn Tyr Gln Val Asp Glu Leu Ser Cys Glu Leu Cys Pro Leu Asp Gln 545 550 555 560 Arg Pro Asn Met Asn Arg Thr Gly Cys Gln Leu Ile Pro Ile Ile Lys 565 570 575 Leu Glu Trp His Ser Pro Trp Ala Val Val Pro Val Phe Val Ala Ile 580 585 590 Leu Gly Ile Ile Ala Thr Thr Phe Val Ile Val Thr Phe Val Arg Tyr 595 600 605 Asn Asp Thr Pro Ile Val Arg Ala Ser Gly Arg Glu Leu Ser Tyr Val 610 615 620 Leu Leu Thr Gly Ile Phe Leu Cys Tyr Ser Ile Thr Phe Leu Met Ile 625 630 635 640 Ala Ala Pro Asp Thr Ile Ile Cys Ser Phe Arg Arg Val Phe Leu Gly 645 650 655 Leu Gly Met Cys Phe Ser Tyr Ala Ala Leu Leu Thr Lys Thr Asn Arg 660 665 670 Ile His Arg Ile Phe Glu Gln Gly Lys Lys Ser Val Thr Ala Pro Lys 675 680 685 Phe Ile Ser Pro Ala Ser Gln Leu Val Ile Thr Phe Ser Leu Ile Ser 690 695 700 Val Gln Leu Leu Gly Val Phe Val Trp Phe Val Val Asp Pro Pro His 705 710 715 720 Ile Ile Ile Asp Tyr Gly Glu Gln Arg Thr Leu Asp Pro Glu Lys Ala 725 730 735 Arg Gly Val Leu Lys Cys Asp Ile Ser Asp Leu Ser Leu Ile Cys Ser 740 745 750 Leu Gly Tyr Ser Ile Leu Leu Met Val Thr Cys Thr Val Tyr Ala Asn 755 760 765 Lys Thr Arg Gly Val Pro Glu Thr Phe Asn Glu Ala Lys Pro Ile Gly 770 775 780 Phe Thr Met Tyr Thr Thr Cys Ile Ile Trp Leu Ala Phe Ile Pro Ile 785 790 795 800 Phe Phe Gly Thr Ala Gln Ser Ala Glu Lys Met Tyr Ile Gln Thr Thr 805 810 815 Thr Leu Thr Val Ser Met Ser Leu Ser Ala Ser Val Ser Leu Gly Met 820 825 830 Leu Tyr Met Pro Lys Val Tyr Ile Ile Ile Phe His Pro Glu Gln Asn 835 840 845 Val Gln Lys Arg Lys Arg Ser Phe Lys Ala Val Val Thr Ala Ala Thr 850 855 860 Met Gln Ser Lys Leu Ile Gln Lys Gly Asn Asp Arg Pro Asn Gly Glu 865 870 875 880 Val Lys Ser Glu Leu Cys Glu Ser Leu Glu Thr Asn Thr Ser Ser Thr 885 890 895 Lys Thr Thr Tyr Ile Ser Tyr Ser Asn His Ser Ile 900 905 29 499 DNA Rattus sp. 29 aaaaggcaaa ttactgtatt tttatggcag gaagaagaaa aagtgttcaa acggtttctg 60 acgaagtcag gtatttaaat ggtgaatgac gatgtgttag tggagatgaa atgaaccaat 120 aaatgattga ttgtcattta tgcaggaaaa taatgctcct tttcaatata actaaacaga 180 gactaattta taagtgcttt attgaaaaat acacatattt tcatataaaa ttacagtagc 240 ggtacgaaga ggtttctacg tactttgcac agcactctgg tggccagtgc caggctgtag 300 gttgtcagct gggtctttgg agcggtatga agtcacctag taactttgtt tacagtgatc 360 caatttaaat tgaattttct ccttaggatt attaatccaa cttaaaaaat tacttgataa 420 taatgattaa taaagatatg tgtagataat caatagctat taaatcttct aatttgtgtc 480 aatggtacca tgtactaat 499 30 1401 DNA Rattus sp. 30 atgcaagagg tggggtctct gcaggtgtca cagttccctt cactgaccac cagtctgggg 60 cattgtctgt cctcacagac agcttgtaat aagagtgttc ctgtccccac agcacgtttt 120 ctggagcagt tgaaggctga gtgtcactac gtcaagggga gggagcatgt gtggagcgtg 180 accagattca tctataacca ggaagagttt gcccgctttg acagtgtctt tgggaagttc 240 ctggcagtga ctgagctggg gcggcccata gctgagtact tgaacaccca gaaggacatg 300 ctggacaatt accgtgcctc tgtggacagg tgcagaaata actatgacct ggttgatatc 360 ttcatgtcga acttaaaagc taaacccaag gtgaccgtgt acccttcaaa gacgcagccc 420 ctggaatacc acaacctcct ggtctgctct gtgagtgact tctaccctgg caccattgaa 480 atcagatggt tccggaatgg tgaggaggaa aagactggag tcgtgtccac cgacctgatc 540 tctaatggag actggaccta ccagaccctg gtgatgctgg agacggttcc tcagggtgga 600 gaggtttaca cctgccaggt ggagcatccc agcctgacca gccctgtcag agtggagtgg 660 agggctcgat ccacatctgc acagaacaag atgctgagcg gagccatggg catggcgcta 720 ggtctgttca tcctcgcggt ggggctgttc atctacttaa ggaatctgag agaggcttcc 780 ctggacaaag agctgtacta ccatggggaa cccctcaatg tcaacgtcca cgtcaccaac 840 aattctgcca agaccgtcaa gaagatcaga gtgtctgtga gacagtatgc cgacatttgc 900 ctcttcagca ctgcgcagta caagtgtcct gtggctcagc ttgaacaaga tgaccaggtg 960 tctcccagtt ccacattctg caaggtgtac accataaccc cgctgctcag tgacaaccga 1020 gagaagcgtg gccttgccct tgatgggcag ctcaagcacg aagacaccaa cctggcttcc 1080 agcaccattg tgaaggaggg agccaacaag gaggtgctgg gaatcctagt atcctacagg 1140 gtcaaggtga agctggtggt gtctcgaggc ggggatgtct ccgtggagct acctttcgtc 1200 ctaatgcacc ccaagcccca cgaccacatc acccttcccc gaccccagtc agccccccgg 1260 gaaatagaca tccctgtgga taccaacctc attgaattcg ataccaacta tgccacagac 1320 gacgacatcg tgtttgagga ctttgcgcgg cttcggctga aggggatgaa ggatgacgac 1380 tgtgatgacc agttctgcta g 1401 31 466 PRT Rattus sp. 31 Met Gln Glu Val Gly Ser Leu Gln Val Ser Gln Phe Pro Ser Leu Thr 1 5 10 15 Thr Ser Leu Gly His Cys Leu Ser Ser Gln Thr Ala Cys Asn Lys Ser 20 25 30 Val Pro Val Pro Thr Ala Arg Phe Leu Glu Gln Leu Lys Ala Glu Cys 35 40 45 His Tyr Val Lys Gly Arg Glu His Val Trp Ser Val Thr Arg Phe Ile 50 55 60 Tyr Asn Gln Glu Glu Phe Ala Arg Phe Asp Ser Val Phe Gly Lys Phe 65 70 75 80 Leu Ala Val Thr Glu Leu Gly Arg Pro Ile Ala Glu Tyr Leu Asn Thr 85 90 95 Gln Lys Asp Met Leu Asp Asn Tyr Arg Ala Ser Val Asp Arg Cys Arg 100 105 110 Asn Asn Tyr Asp Leu Val Asp Ile Phe Met Ser Asn Leu Lys Ala Lys 115 120 125 Pro Lys Val Thr Val Tyr Pro Ser Lys Thr Gln Pro Leu Glu Tyr His 130 135 140 Asn Leu Leu Val Cys Ser Val Ser Asp Phe Tyr Pro Gly Thr Ile Glu 145 150 155 160 Ile Arg Trp Phe Arg Asn Gly Glu Glu Glu Lys Thr Gly Val Val Ser 165 170 175 Thr Asp Leu Ile Ser Asn Gly Asp Trp Thr Tyr Gln Thr Leu Val Met 180 185 190 Leu Glu Thr Val Pro Gln Gly Gly Glu Val Tyr Thr Cys Gln Val Glu 195 200 205 His Pro Ser Leu Thr Ser Pro Val Arg Val Glu Trp Arg Ala Arg Ser 210 215 220 Thr Ser Ala Gln Asn Lys Met Leu Ser Gly Ala Met Gly Met Ala Leu 225 230 235 240 Gly Leu Phe Ile Leu Ala Val Gly Leu Phe Ile Tyr Leu Arg Asn Leu 245 250 255 Arg Glu Ala Ser Leu Asp Lys Glu Leu Tyr Tyr His Gly Glu Pro Leu 260 265 270 Asn Val Asn Val His Val Thr Asn Asn Ser Ala Lys Thr Val Lys Lys 275 280 285 Ile Arg Val Ser Val Arg Gln Tyr Ala Asp Ile Cys Leu Phe Ser Thr 290 295 300 Ala Gln Tyr Lys Cys Pro Val Ala Gln Leu Glu Gln Asp Asp Gln Val 305 310 315 320 Ser Pro Ser Ser Thr Phe Cys Lys Val Tyr Thr Ile Thr Pro Leu Leu 325 330 335 Ser Asp Asn Arg Glu Lys Arg Gly Leu Ala Leu Asp Gly Gln Leu Lys 340 345 350 His Glu Asp Thr Asn Leu Ala Ser Ser Thr Ile Val Lys Glu Gly Ala 355 360 365 Asn Lys Glu Val Leu Gly Ile Leu Val Ser Tyr Arg Val Lys Val Lys 370 375 380 Leu Val Val Ser Arg Gly Gly Asp Val Ser Val Glu Leu Pro Phe Val 385 390 395 400 Leu Met His Pro Lys Pro His Asp His Ile Thr Leu Pro Arg Pro Gln 405 410 415 Ser Ala Pro Arg Glu Ile Asp Ile Pro Val Asp Thr Asn Leu Ile Glu 420 425 430 Phe Asp Thr Asn Tyr Ala Thr Asp Asp Asp Ile Val Phe Glu Asp Phe 435 440 445 Ala Arg Leu Arg Leu Lys Gly Met Lys Asp Asp Asp Cys Asp Asp Gln 450 455 460 Phe Cys 465 32 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 32 gagcattggt gcagccagta 20 33 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 33 gtctgagaac aagacaaagg 20 34 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 34 ggtagaagcc tgctttaaac 20 35 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 35 ccaaggttct tcctcaacac 20 36 19 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 36 tgagagctgt caggagagc 19 37 21 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 37 ggcatgaatg aggaggccga c 21 38 39 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 38 tatttaggtg acactatagg agcattggtg cagccagta 39 39 39 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 39 tatttaggtg acactatagg tctgagaaca agacaaagg 39 40 39 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 40 tatttaggtg acactatagg gtagaagcct gctttaaac 39 41 39 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 41 taatacgact cactataggg gccaaggttc ttcctcaac 39 42 40 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 42 taatacgact cactataggg gtgagagctg tcaggagagc 40 43 42 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 43 taatacgact cactataggg gggcatgaat gaggaggccg ac 42 44 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 44 gagctggaca ccattgactc 20 45 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 45 gactgcgtct tggtcatttc 20 46 24 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 46 caacaccgaa tgcacgaaga catc 24 47 21 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 47 atgagtgcaa ggtaactctg g 21 48 20 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 48 tcacgtttcc aaggttcttc 20 49 24 DNA Artificial Sequence Description of Artificial Sequence Synthetic primer 49 ccaatggaaa gtcagctgac tgca 24 

1. A method of identifying a gene associated with a desired behavior in a mammal, comprising the steps of: (a) Providing a test population of mammals having the desired behavior; (b) Providing a control population of mammals lacking the desired behavior; (c) Isolating and pooling expressed RNA from neural tissue of the test population; (d) Isolating and pooling expressed RNA from neural tissue of the control population; (e) Determining the level of expression of a plurality of genes in each of the RNA pools created in steps (c ) and (d); and, (f) Selecting a gene from the plurality of genes, the expression of which differs between the test population and the control population of mammals, wherein the selected gene is a candidate gene associated with said desired behavior.
 2. A method of identifying a gene associated with cognitive function in a mammal, comprising the steps of: (a) Providing a test population of mammals having a desired cognitive function; (b) Providing a control population of mammals impaired in such cognitive function; (c) Isolating and pooling expressed RNA from neural tissue of the test population; (d) Isolating and pooling expressed RNA from neural tissue of the control population; (e) Determining the level of expression of a plurality of genes in each of the RNA pools created in steps (c ) and (d); and, (f) Selecting a gene from the plurality of genes, the expression of which differs between the test population and the control population of mammals, wherein the selected gene is a candidate gene associated with cognitive function.
 3. The method of claim 1 or 2 wherein the level of expression of said plurality of genes is detected by a method selected from the group consisting of: microarray analysis, in situ hybridization histochemistry, quantitative PCR, SAGE analysis, Northern blot analysis, and dot blot analysis.
 4. The method of claim 1 or 2 wherein said plurality of genes comprises a gene involved in glutamate transport.
 5. The method of claim 4 wherein said gene involved in glutamate transport is selected from the group consisting of: EAAT1, EAAT2, EAAT3, EAAT4, and EAAT5.
 6. The method of claim 1 or 2 wherein said plurality of genes comprises a gene other than a glutamate transporter selected from the group consisting of: EAAT1, EAAT2, EAAT3, EAAT4, and EAAT5.
 7. A method of screening compounds for utility in promoting cognitive function, comprising the steps of: (a) Administering a test compound to a mammal; (b) Determining the level of expression of a gene in neural tissue of said mammal following administration of said test compound; (c) Comparing said level of expression of said gene to a reference level of expression thereof in neural tissue of a mammal to whom said test compound was not administered; and, (d) Determining whether the level of expression of said gene differs from the corresponding reference level of expression thereof, wherein said difference indicates that the test compound is a candidate therapeutic agent for promoting cognitive function.
 8. The method of claim 7 comprising the further step of comparing said level of expression of said gene to a reference level of expression thereof in neural tissue of a mammal to whom ceftriaxone was administered.
 9. The method of claim 7 wherein the level of expression of said gene is detected by a method selected from the group consisting of: microarray analysis, in situ hybridization histochemistry, quantitative PCR, SAGE analysis, Northern blot analysis, and dot blot analysis.
 10. The method of claim 7 wherein said gene is a glutamate transporter.
 11. The method of claim 10 wherein said gene is selected from the group consisting of: EAAT1, EAAT2, EAAT3, EAAT4, and EAAT5.
 12. The method of claim 1, 2 or 7 wherein said neural tissue is hippocampal tissue.
 13. A method of screening compounds for utility in promoting cognitive function, comprising the steps of: (a) Administering a test compound to a mammal; (b) Determining the level of expression of a glutamate transporter gene in neural tissue of said mammal following administration of said test compound; (c) Comparing said level of expression of said gene to a reference level of expression thereof in neural tissue of a mammal to whom said test compound was not administered; and, (d) Determining whether the level of expression of said gene differs from the corresponding reference level of expression thereof, wherein said difference indicates that the test compound is a candidate therapeutic agent for promoting cognitive function.
 14. A method of screening compounds for utility in promoting cognitive function in a mammal, comprising the steps of: (a) Contacting a test compound with a cell expressing a gene listed in FIG. 4; and (b) Determining whether the level of expression of said gene is changed by contact of said cell with said test compound, said change if present being indicative of the ability of said compound to promote cognitive function in a mammal in need thereof.
 15. The method of claim 14 wherein said cell is derived from neural tissue.
 16. The method of claim 15 wherein said cell is an immortalized cell.
 17. The method of claim 16 wherein said cell is a neuronal cell line, a glial cell line, or an astrocyte cell line.
 18. The method of claim 14 wherein said gene is a glutamate transporter.
 19. The method of claim 1, 2, 7, 13, or 14 wherein the level of expression of said gene is increased.
 20. The method of claim 1, 2, 7, 13, or 14 wherein the level of expression of said gene is decreased.
 21. The method of claim 7, 13, or 14 wherein said test compound is a small molecule.
 22. The method of claim 21 wherein said test compound is:

wherein, individually for each occurrence: L is O or S; R is H, C₁₋₁₀ alkyl, C₁₋₁₀ alkoxy, aryl, aralkyl, —OCH₂CO₂H; R¹ is —(CH₂)_(n)—C(O)X wherein X is OH, NR₂, SH, O-alkali metal, or —OC(CH₃)OC(O)OCH(CH₃)₂; and n is an integer from 0 to 6 inclusive; R² is H, C₁₋₁₀ alkyl, C₂₋₈ alkenyl, or —(CH₂)_(a)—W—R³ wherein R³ is H, C₁₋₁₀ alkyl, —C(O)C₁₋₁₀ alkyl, —C(O)NR₂, aryl, aralkyl, or A; W is O, S, or NR⁴; and a is an integer from 1 to 6 inclusive; wherein R⁴ is H, C₁₋₁₀ alkyl, —C(O)C₁₋₁₀ alkyl, aryl, aralkyl, or R³ and R⁴ taken together may form an unsubstituted or substituted heteroalkyl or heteroaryl ring; the

line indicates either a single or double bond; R⁵ is R¹, H, SO₃H, aryl, C₁₋₁₀ alkyl, aralkyl; or R⁵ is selected from the group consisting of ═CHCH₂CO₂H and ═NR when the

line is a double bond; m is 0 or 1; and A is aryl or heteroaryl of formula Ia:

wherein, independently for each occurrence: J is O, S, NR⁶, or CR⁶; and y is 1 or2; wherein R⁶ is an electron pair, H, C₁₋₁₀ alkyl, C₁₋₁₀ alkoxy, aryl, or —NR₂; or A is heterocycloalkyl of formula Ib or Ic:

wherein, independently for each occurrence: J is O, S, or NR; and X is O or H₂.
 23. The method of claim 21 wherein said test compound is:

wherein, independently for each occurrence: X is —OH, C₁₋₁₀ alkoxy, —O-alkali metal, —N(R¹)₂, —SH, or —S—C₁₋₁₀ alkyl; R is a straight chain or branched C₁₋₃₀ alkyl; and R¹ is H, C₁₋₁₀ alky, C₂₋₁₀ alkenyl, C₂₋₁₀ alkynyl, aryl, or aralkyl; provided that R may be unsubstituted or substituted by one or more —OH, C₁₋₁₀ alkoxy, —N(R¹)₂, —SH, —S—C₁₋₁₀ alkyl, or aryl.
 24. A library comprising a plurality of cDNA sequences coding for genes that are differentially expressed in neural tissue upon preservation of cognitive function in a mammal.
 25. A library comprising a plurality of cDNA sequences coding for genes that are differentially expressed in neural tissue upon treatment of a mammal with ceftriaxone.
 26. A library comprising a plurality of cDNA sequences coding for genes that are differentially expressed in neural tissue upon treatment of a mammal with valproic acid.
 27. The library of claim 24, 25 or 26 wherein said plurality of cDNA sequences comprises a sequence derived from a glutamate transporter gene.
 28. The library of claim 27 wherein said glutamate transporter gene is selected from the group consisting of: EAAT1, EAAT2, EAAT3, EAAT4, and EAAT5.
 29. The library of claim 27 wherein said plurality of cDNA sequences comprises at least 20% of all sequences present therein.
 30. The library of claim 27 wherein said plurality of cDNA sequences comprises at least 50% of all sequences present therein.
 31. The library of claim 27 wherein said plurality of cDNA sequences comprises at least 80% of all sequences present therein.
 32. A microarray chip comprising a solid support having attached thereto, at individually addressed locations, cDNA sequences corresponding to members of the library of claims 24, 25 or
 26. 33. A pharmaceutical composition comprising a therapeutically effective amount of a compound that stimulates neural tissue expression of a gene listed in FIG.
 4. 34. The pharmaceutical composition of claim 33 wherein said gene is a glutamate transporter.
 35. The pharmaceutical composition of claim 34 wherein said glutamate transporter is EAAT1, EAAT2, EAAT3, EAAT4, or EAAT5.
 36. The pharmaceutical composition of claim 33 wherein said compound is a small molecule.
 37. A pharmaceutical composition comprising a therapeutically effective amount of:

wherein, individually for each occurrence: L is O or S; R is H, C₁₋₁₀ alkyl, C₁₋₁₀ alkoxy, aryl, aralkyl, —OCH₂CO₂H; R¹ is —(CH₂)_(n)—C(O)X wherein X is OH, NR₂, SH, O-alkali metal, or —OC(CH₃)OC(O)OCH(CH₃)₂; and n is an integer from 0 to 6 inclusive; R² is H, C₁₋₁₀ alkyl, C₂₋₈ alkenyl, or —(CH₂)_(a)—W—R³ wherein R³ is H, C₁₋₁₀ alkyl, —C(O)C₁₋₁₀ alkyl, —C(O)NR₂, aryl, aralkyl, or A; W is O, S, or NR⁴; and a is an integer from 1 to 6 inclusive; wherein R⁴ is H, C₁₋₁₀ alkyl, —C(O)C₁₋₁₀ alkyl, aryl, aralkyl, or R³ and R⁴ taken together may form an unsubstituted or substituted heteroalkyl or heteroaryl ring; the

line indicates either a single or double bond; R⁵ is R¹, H, SO₃H, aryl, C₁₋₁₀ alkyl, aralkyl; or R⁵ is selected from the group consisting of ═CHCH₂CO₂H and ═NR when the

line is a double bond; m is 0 or 1; and A is aryl or heteroaryl of formula Ia:

wherein, independently for each occurrence: J is O, S, NR⁶, or CR⁶; and y is 1 or 2; wherein R⁶ is an electron pair, H, C₁₋₁₀ alkyl, C₁₋₁₀ alkoxy, aryl, or —NR₂; or A is heterocycloalkyl of formula Ib or Ic:

wherein, independently for each occurrence: J is O, S, or NR; and X is O or H₂.
 38. A pharmaceutical composition comprising a therapeutically effective amount of:

wherein, independently for each occurrence: X is —OH, C₁₋₁₀ alkoxy, —O-alkali metal, —N(R¹)₂, —SH, or —S—C₁₋₁₀ alkyl; R is a straight chain or branched C₁₋₃₀ alkyl; and R¹ is H, C₁₋₁₀ alky, C₂₋₁₀ alkenyl, C₂₋₁₀ alkynyl, aryl, or aralkyl; provided that R may be unsubstituted or substituted by one or more —OH, C₁₋₁₀ alkoxy, —N(R¹)₂, —SH, —S—C₁₋₁₀ alkyl, or aryl.
 39. A pharmaceutical composition comprising a therapeutically effective amount of a therapeutic agent, other than ceftriaxone or valproic acid, identified according to the method of claim 7, 13, or
 14. 40. A method of preserving cognitive function in a mammal in need thereof, comprising the step of stimulating, in said mammal, neural tissue expression of a glutamate transporter gene.
 41. A method of treating impaired cognitive function in a mammal, comprising the step of stimulating, in said mammal, neural tissue expression of a glutamate transporter gene.
 42. A method of preserving cognitive function in a mammal in need thereof, comprising the step of administering a pharmaceutical composition of claim 33 to said mammal.
 43. A method of preserving cognitive function in a mammal in need thereof, comprising the step of administering a pharmaceutical composition of claim 37 to said mammal.
 44. A method of preserving cognitive function in a mammal in need thereof, comprising the step of administering a pharmaceutical composition of claim 38 to said mammal.
 45. A method of preserving cognitive function in a mammal in need thereof, comprising the step of administering a pharmaceutical composition of claim 39 to said mammal.
 46. The method of claim 43, wherein said mammal is free of symptoms of an infectious disease for which antibiotic treatment is indicated.
 47. A method of promoting cognitive function in a mammal in need thereof, comprising administering to said mammal an amount of a pharmaceutical composition of claim 33 sufficient to promote cognitive function selected from the group consisting of: spatial memory acquisition, long-term spatial memory and spatial memory retrieval.
 48. A method of preserving cognitive function in an aged mammal, comprising the step of administering a therapeutically effective amount of ceftriaxone or an analog or derivative thereof to said mammal.
 49. A method of treating impaired cognitive function in a mammal, comprising the step of administering a therapeutically effective amount of ceftriaxone or an analog or derivative thereof to said mammal.
 50. The method of claim 41 or 49 wherein said impaired cognitive function is a condition selected from the group consisting of: mild cognitive impairment, age related cognitive decline, memory loss, senility, and dementia.
 51. The method of claim 41 or 49 wherein said impaired cognitive function is Alzheimer's Disease.
 52. The method of claim 41 or 49 wherein said mammal is human.
 53. The method of claim 40, 42, 43, 44, 45, 46, 47, or 47 wherein said mammal is human. 